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Abstract. We strengthen the Carleson-Hunt theorem by proving U' estimates for 
the r-variation of the partial sum operators for Fourier series and integrals, for p > 
max{r', 2}. Four appendices are concerned with transference, a variation norm Menshov- 
Paley-Zygmund theorem, and applications to nonlinear Fourier transforms and ergodic 
theory. 



1. Introduction 

For an integrable function / on the circle group T = M/Z, and k £ Z we denote by 
~ So f{y)^~'^^^^^ '^y Fourier coefficients and consider the partial sum operators 
Sn for the Fourier series, 

n 

(1) Snf{x) = 5[/](n,x) = Ae^-^-; 

k=—n 

here n G No = {0, 1, 2, . . . }. The celebrated theorem by Carleson [1] states that if / is 
square integrable then Snf converges to / almost everywhere. Hunt [13] extended this 
result to LP(T) functions, for 1 < p < oo, and proved the inequality 

(2) II sup \Snf\ ll^pfTT) - ^ ll/llif m 

for all / G LP(T); see also [TU], [2D], and [T2] for other proofs of this fact. 

The purpose of this paper is to strengthen the Carleson-Hunt result for LP functions, 
1 < p < oo, and show that, for r > max{2,p'}, the (strong) r-variation of the sequence 
{Snfix)}n£No is finite for almost every x G [0, 1]. This can be interpreted as a statement 
about the rate of convergence. To fix notation, we consider real or complex valued 
sequences {a„}„gNo and define their r-variation to be 




where the sup is taken over all K and then over all increasing sequences of nonnegative 
integers no < • • • < nx- Note that the variation norms are monotone decreasing in 
the parameter r. Next, for a sequence F = {Fn} of Lebesgue measurable functions one 
defines the r-variation of -F at x, sometimes denoted by V^F^x) as the norm of the 
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sequence We denote the r-variation of the sequence F„ = Snf by V^S[f]. The 

variation norms and the r-variation operator can be defined in a similar fashion, if the 
index set Nq is replaced by another subset of M (often R"*" or M itself). 

Let r' := r/(r — 1), the conjugate exponent of r. 
Theorem 1.1. Suppose r > 2 and r' < p < oo. Then, for every f £ LP{T), 

(4) \\S[f]\\^,^y^^<C,JfU.. 

At the endpoint p = r' a restricted weak type result holds; namely, for any 
the function VS[f] belongs to L'^''°°(T). 

It is immediate that for r < oo implies a quantitative form of almost everywhere 
convergence of Fourier series, improving over the standard qualitative result utilizing the 
weaker r = oo inequality and convergence on a dense subclass of functions. 

As will be discussed in Section [2l the conditions on the exponents in 1^ are sharp. 
Moreover, in the endpoint case p = r' the Lorentz space cannot be replaced by a 

smaller Lorentz space. 

By standard transference arguments (see Appendix [A]) Theorem 11.11 is implied by a 
result on the partial (inverse) Fourier integral of a Schwartz function / on M is defined 

as 

s[fmx)= f me-'-'^^dv 

J — oo 

where f{rj) = J /(y)e~^'^*^'' dy defines the Fourier transform of /. 

Theorem 1.2. Suppose r > 2. Then S extends to a bounded operator S : ^ L^(y) 
for r' < p < oo. Moreover S maps U'^^ boundedley to 

Note that if in the above definition of the mixed L^iV^) spaces we interchange the 
order between integration in the x variable and taking the supremum over the choices of 
K and the points to so that these choices become independent of the variable x, 
then the estimates corresponding to Theorem 11.21 are weaker; they follow from a square 
function inequality of Rubio de Francia [30] for p>2, see also [29] for a related endpoint 
result for p < 2, and |18j for a proof of Rubio de Francia's inequality which is closer to 
the methods of this paper. 

While the concept of r-variation norm is at least as old as Wiener's 1920s paper on 
quadratic variation |34] . variational estimates have been pioneered by D. Lepingle (|22j) 
who proved them for martingales. Simple proofs of Lepingle's result based on jump 
inequalities have been given by Pisier and Xu [27] and by Bourgain [1] , and applications 
to other families of operators in harmonic analysis such as families of averages and 
singular integrals have been considered in [1], and the subsequent papers [TJ], [2], [15] 
(c/. the bibliography of [15] for more references). Bourgain [1] used variation norm 
estimates (or related oscillation estimates which are intermediate in difficulty between 
maximal and variation norm estimates) to prove pointwise convergence results without 
previous knowledge that pointwise convergence holds for a dense subclass of functions. 
Such dense subclasses of functions, while usually available in the setting of analysis on 
Euclidean space, are less abundant in the ergodic theory setting. In Appendix |D] we 
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demonstrate the use of Theorem 11.21 in the setting of Wiener- Wintner type theorems 
as developed in [19]. We note that the Carleson-Hunt theorem has previously been 
generalized by using other norms in place of the variation norm, see for example the use 
of oscillation norms in [19], and the norms in [8], [9]. 

We are also motivated by the fact that variation norms are in certain situations more 
stable under nonlinear perturbation than supremum norms. For example one can deduce 
bounds for certain r-variational lengths of curves in Lie groups from the corresponding 
lengths of the "trace" of the curves in the corresponding Lie algebras, see Appendix [Cl 
for definitions and details. What we have in mind is proving Carleson type theorems for 
nonlinear perturbations of the Fourier transform as discussed in [25] , |26j . Unfortunately 
the naive approach fails and the ultimate goal remains unattained since we only know 
the correlation between lengths of the trace and the original curve for r < 2, while the 
variational Carleson theorem only holds for r > 2. Nonetheless, this method allows one 
to see that a variational version of the Christ-Kiselev theorem [6] follows from a varia- 
tional Menshov-Paley-Zygmund theorem which we prove in Appendix[Bj The variational 
Carleson inequality can be viewed as an endpoint in this theory. 

Our proof of Theorem 1 1 . 2 1 will follow the method of [20] as refined in |12j . Naturally one 
has to invoke variation norm results in the setting of individual trees, which is achieved 
by adapting D. Lepingle's result ([22]) to the setting of a tree. The authors initially had 
a proof of the case p > 2 and r > p of Theorem 11.21 more akin to [20] , while improvement 
to r > 2 for such p provided a stumbling block. This stumbling block was removed 
by better accounting for trees of given energy, as described in the remarks leading to 
Proposition 14. 3i In Section [3] we reduce the problem to that of bounding certain model 
operators which map / to linear combinations of wave-packets associated to collections 
of multitiles. In Section[5]we bound the model operators when the collection of multitiles 
is of a certain type called a tree; this bound is in terms of two quantities, energy and 
density, which are associated to the tree. These quantities are defined in Section U] and 
an algorithm is given to decompose an arbitrary collection of multitiles into a union of 
trees with controlled energy and density. Section [6] contains two auxiliary estimates. All 
these ingredients are combined to complete the proof in Section [71 

Some notation. For two quantities A and B let A < B denote the statement that A < 
CB for some constant C (possibly depending on the parameters p and r). The Lebesgue 
measure of a set E is denoted either by \E\ or by meas(i?). The indicator function of E 
is denoted by \e- For a subset C M and a G M we set a + E = E + a = {x : x — a ^ E}. 
If / is a finite interval / with center c(/) we denote by C/ the C dilate of / with respect 
to its center, i.e. the set of all x for which c(/) -|- ^ ^^'^ G /. 



2. Optimality of the exponents 

Since Theorem 11.21 implies Theorem 11.11 ( cf. Appendix [A]) we have to discuss the 
optimality only for the Fourier series case. The necessity of the condition r > 2 follows 
from a corresponding result for the Cesaro means; its proof by Jones and Wang |16j was 
based on a probabilistic result of Qian [28]. 
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We show the necessity of the condition p > r' in Theorem 11.11 Let 

^ ^2nikx ^ sin((2n + 1)ttx) 



sm vrx 

k=-n ^ ' 



the Dirichlet kernel, and let / be the de la Vallee-Poussin kernel which is defined by 
= 2K2N+1 - Kn via the Fejer kernel Kn = {N + 1)-^ EjLo^i- Then [p^]k = 1 
for \k\<N + l and thus Snf^ = Dn for \n\ <N + 1. We have ||/7v||l1(t) = 0(1), and 
II/tvIIoo = 0{N) and therefore ||/7v||lp..(t) = 0{N^'^/p). 

Let N > 10^ and 8N^'^ <x< 1/8. Let K = K{x) be the largest integer < Nx. Then 
for < < 2K{x) there are integers nk{x) < N so that (2nfc(x) + l)x G (| + A;, | + k), 
in particular nfc(x) < nk^i{x) for k < 2K{x) — 1. Observe sin((2n2j(2;) + 1)'kx) > \/2/2 
and sin((2n2j+i(x) + l)7rx) < -^2/2 for < j < K{x) - 1. This gives 

,1/, i^W-i 



( X] l'5n2^+i(x)/^-'5„2^(x)/^r) -( \D., 
j=o j=o 



sin(7rx) 



and this implies for large N 

\\V''{Sf^)\\Lv. ifp<r', 



1 1 



||/jv||lp,i " \{\og Nfl' ■iip = r'. 

Thus the — >• L^(y) boundedness does not hold for p < r'; moreover the L'^''^ 
L^'''^(y) boundedness does not hold for s < oo. 



3. The model operators 



We shall show in appendix lAl how to deduce Theorem 11.11 from Theorem ll.21 To start 
the proof of the main Theorem 11.21 we describe some reductions to model operators 
involving wave packet decompositions. 

First, by interpolation it suffices to prove for p > r' the restricted weak type L^'^ — )• 
LP'°°(V^) bound. Next, by the monotone convergence theorem it suffices to estimate 
LP'°°(yr) on finite x-intervals [— ^, with constant independent of A. By another 
application of the monotone convergence theorem it suffices, for any fixed K, to prove 
the LP'^ LP'°°([-^,^]) bound for 

K ^ , 

(5) sup (y2\Sf{^e,x)-Sf{Ci-i,x)\'') 

where the sup is taken over all (^o, ■ ■ ■ > Ck) with < for £ = 1, . . . , ii'. Moreover, 
by the density of Schwartz functions in L^'^ it suffices to prove a uniform estimate for 
all Schwartz functions. Note that for any Schwartz function / the expression S[f]{(,,x) 
depends continuously on {(,,x). Therefore it suffices to bound the expression analogous 
to ([5]) where we impose the strict inequality < ^£ for £ = 1, . . . , if. Moreover, by the 
continuity it suffices for each finite set H C M to prove bounds for this expression under 
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the assumption that the belong to H, and we may also assume that H does not contain 
any numbers of the form n2™ with m,n G Z (i.e. no endpoints of dyadic intervals). 

We may now linearize the variation norm. Fix K gN, measurable real valued functions 
£,oix) < . . . < iK{x), with values in H and measurable complex valued functions 01(2;), 
. . ., axix) satisfying 

\ai{x)f + ... + \aK{xY = 1. 

Let 

K 

Sy]{x)=Y,{S[mu{x),x)-S[mk-i{x),x))ak{x). 

k=l 

Theorem 11.21 will now follow from the estimate 

(6) ||5'[/]||^„^(j,)<C||/||ip,i(M) 

where C is independent of H and the linearizing functions, and where / is any 
Schwartz function. Finally, in order to prove ([6]) for any fixed H we may assume that / 
has compact support in M \ H since the space of Schwartz functions with this property 
is dense in L^'^, 1 < p < 00. 

Let V = {[2*^m, 2''(m + 1)) : m,k G Z} be the set of dyadic intervals. A tile will be 
any rectangle I x to where I,uj are dyadic intervals, and \I\\uj\ = 1/2. We will write S' as 
the sum of wave packets adapted to tiles, and then decompose the operator into a finite 
sum of model operators by sorting the wave packets into a finite number of classes. For 
each k, 

To suitably express the difference above as a sum of wave packets, we will first need 
to construct a partition of adapted to certain dyadic intervals. The fact that 

has two boundary points instead of the one from (— oo,^fc) will necessitate a 
slightly more involved discretization argument than that in |20| . 

For any ^ < let J^^g/ be the set of maximal dyadic intervals J such that J C 
and dist(J,0 > |J|, dist(J,^') > |J|. Let 1/ be a C°° function from M to [0,1] which 
vanishes on (— 00,— 10~^], is identically equal to 1 on [10~^,oo), and so that i^'ix) > 
for — 10~^ < X < 10~^. Given an interval J = [a,b), and i £ {—1,0, 1}, define 

Thus if c(J) = the center of J, then 

(7) ^jAO = M^-^) where ^rj) = u{2~\^ + \)) - u{r^ - \) , 

and we notice for i E {—1,0, 1} both and are C°° functions supported in [— ^, ^] 
(more precisely in [— ^])- Hence ifj^i is supported on a ||-dilate of J with respect 
to its center. 

For each J G J^,^', one may check that there is a unique interval J' G J^^g' which lies 
strictly to the left of J and satisfies dist(J', J) = 0, and one may check that J' has size 
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|J|/2, |J|, or 2\J\. We define ipj = where i{J) is chosen so that \ J'\ = 2*('^)|J|. 

Then 

(8) hu')i^)m= E ^mKv)- 

Since we assume that / is compactly supported in M \ H we see that that for every pair 
^ < ^' with ^, ^' S H only a finite number of dyadic intervals J G J^^^' are relevant in 

We now write each multiplier ipj as the sum of wave packets. For every tile P = I x J, 
define 

Mx) = \I\'^'j'-'[V^{x-c{I)) 

where c(/) is the center of / and J-~^ denotes the inverse Fourier transform. For each 
J, we then have 

(9) E {f^^Ixj)4>IxJ = f^j- 

l^l=l/(2|J|) 

To see this we use a Fourier series expansion (c/. |33j). We first observe that 4>p{^) = 
\/RT V (pj{^)e~'^'^^^^^^^ and use {f,(f>p) = (/,^p). Now let parametrize the centers of the 
dyadic intervals / of length L by -{k-\)L, k G Z. Set gj{uj) := [^/^/](c(J)+L-ia;)e™ 



2 

and note that 5j is supported in [— ^n, -M]- The left hand side of ([9]) is equal to 



7,. ^ 



2m{kL-^)e, 



, [-1/2 

which gives ([9]). This in turn yields the representation of S'[f] in terms of wave packets: 

K 

(10) s'[f]{x) = j2{ E E {f^^ixj)^ixj)ak{x). 

k=l JGJe,_i(x),€fe(.) |/I = (1/{2|J|)) 



For the function / under consideration the above Fourier series expansion converges 
in L^-Sobolev spaces of arbitrary high order and thus the convergence in pop is uniform 
for X G [— Therefore it suffices, for any finite family *p of tiles, to consider the 
operator 5" defined by 

K 

(11) 5"[/](x) = E( E {f,4>ixj)4>ixjix))ak{x). 

k=i {/,J)e'P 

■^^■'«fe-i{^).«fe(^) 



The wave packets will be sorted into a finite number of classes, each well suited for 
further analysis. Sorting is accomplished by dividing every into a finite number of 
disjoint sets. These sets will be indexed by a fixed subset of {1,2,3} x {1,2,3, 4}^ x 
{left, right}. Specifically, for each {m,n, side) £ {1,2,3,4}^ x {left, right}, we define 

• Jc,i',(i,m,n,side) = {J £ V : J C (C,C')i C is in the interval J - (m + 1)| J|, ^ is in 
the interval J + (n + 1)| J|, and J is the side-child of its dyadic parent}. 
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• ^^,i',(2,m,n,side) = {J & V : J C {^,^'), ^ is ill the interval J - {m + 1)\J\, 
dist(^', J) > n\J\, and J is the side-child of its dyadic parent}. 

• Jw,{3,m,n,side) = {J € V : J C (^,^')> dist(^, J) > m\J\, is in the interval 
J + (n + 1)1 J|, and J is the side-child of its dyadic parent}. 

We will choose R C {1,2,3} x {1,2,3,4}^ x {left, right} so that for each the 
collection {J^.^'.p}peR is pairwise disjoint and J^^^/ = Upg/jJ^^^/^p. We will also assume 
that for each p G R there is an i{p) e {—1, 0, 1} such that \ J'\ = 2*('')| J| for every ^ < 
J £ J^.^',p and J' G J^_^/ with J' strictly to the left of J and dist(J, J') = 0. One may 
check that these conditions are satisfied, say, for 

R = {(1, 2, 1, left), (1, 2, 2, left), (1, 3, 1, left), 

(1, 3, 2, left), (2, 1, 1, left), (2, 1, 1, right), (2, 2, 1, right), 

(3, 4, 1, left), (3, 3, 1, right), (3, 4, 2, left)}. 

It now follows that 

peR 

where 




5fc_l(a:),efc(a:),P 



It will be convenient to rewrite each operator in terms of multitiles. A mul- 
titile will be a subset of M? of the form I x io where / G D and where to is the 
union of three intervals w;, i^^, in V. For each p = {i,m,n, side) £ R, we con- 
sider a set of /9-multitiles which is parameterized by {{I,uJu) '■ I-,^u G ^, l-^H'^ul = 
1/2, and Uu is the side-child of its parent}. Specifically, given oj^ = [a, 6) 

• If /9 = (1, m, n, side) then uji = uju — {m + l)|a;u| and ojh = uj^ + {n + l)\uju\- 

• If /9 = (2, m, n, side) then uji = uj^ — {m + l)|a;u| and w/i = [a + (n + l)|a;u|, oo). 

• If p = (3, m, n, side) then w/ = (— oo, 6 — (m + l)|t^u|) and w/^ = a;„ + (n + l)|tJu|- 

For i = 1, 2, 3 we shall say that p is an i-index if p = (i, m, n, side). For every p-multitile 
P, let ap{x) = afc(x) if /e satisfies 1 < k < K and ^fc_i(x) S ui and Cfc(x) E cj/i (such a 
would clearly be unique), and ap{x) = if there is no such k. Then, 

SV]i^)= {fAp)M^)ap{x) 

PGPp 

where, for each p-multitile P, 4>p{x) = \/\I\J^~^[^/^fZ~i{pj]{x — c(/)) and Pp denotes the 
set of all p- multitiles for which I x oju belongs to 

Inequality Q and hence Theorem 1 1 . 2 1 will then follow after proving the bound 

(12) mf]\\L.-^<\\f\\LP-^ 

for each p £ R. We shall only give the proof of this estimate for the case that p is a 
1-index or p is a 2-index, and the case where p is a 3-index can be deduced by symmetry 
considerations. Indeed, if P = {I,u)u) ■= ([^J, ^)5 [c, d)) and P := ([— 6, a), [— d, c)) then 
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{f,(pp)(pp = (/( — )-,4>p)4>p where, in the definition of the 0p the function Vi in ([7]) 
is replaced with •) (both are supported in (— ^i^))- Now reflection sends a half 
open interval [a, 6) to a half open interval (—6, —a], however this plays no role in our 
symmetry argument if, as we do, we assume that the set H does not contain endpoints 
of dyadic intervals. We then see that the estimation of S^lf] for p = {3,m,n, side) is 
equivalent to the estimation of a 5'^[/(— •)] where the corresponding set {P} of multitiles 
is replaced with a set {P} of index p = (2, n, m, opposite side) and the set H is replaced 
with : — ^ G H}. Beginning with (jl4p both or fj(— •) are allowed in the definition of 
the functions ifj^i and (pp. 

By the usual characterization of L^'^ as superpositions of functions bounded by char- 
acteristic functions it suffices to show that 

meas({x : \SP[f]ix)\ > A}) < CPX-p\F\ 

where F C M, |F| > 0, |/| < If, A > 0, 2 < r < oo, and r' < p < (1/2 - l/r)-i. This is 
accomplished by proving that for every measurable E cM, 

(13) meas({x G E : \SP[f]{x)\ > C(|F|/|i?|) '/^}) < \E\/2. 

Indeed, if we set Ex = {x : \Sf[f]{x)\ > A} then by the finiteness of the set of tiles 
under consideration the set Ex has a priori finite measure. If \Ex\ < CPX~^\F\ then 
there is nothing to prove. If the opposite inequality \Ex\ > C^A~^|F| were true then A > 
C{\F\/\Ex\)^^P and inequality ^ applied to = would yield that \Ex\ < \Ex\/2, a 
contradiction. 

We finally note that, after possibly rescaling, we may assume that 1 < |£^| < 2 in (llSp . 
The next four sections will be devoted to the proof of inequality (|13p in this case. 



4. Energy and density 

Recall that S^[f]{x) = '^p{f, 4>p)4'p'^p where P ranges over an arbitrary finite col- 
lection of /?-multitiles, p is a 1 or 2-index. It is our goal to show (jl3p and for this and 
the next chapter we fix the function / with |/| < 1^ and the set E. 

Fix 1 < Cs < C2 < Ci, with C2 G N, such that for every multitile P, 

supp(^p) C C3UJU 

C2UJU n C2UJ1 = 0, C2UJU n w/i = 0, 

C2OJ1 C Cioju, C20JU C ClOJi; 

recall that dilations of finite intervals are with respect to their center. One may check 
that the values C3 = 11/10, C2 = 2, and Ci = 12 satisfy all these properties. 

The wave packet is adapted to the multitile P. As (pp is compactly supported (in 
C^uju) the function (j)p cannot have compact support, but as a replacement we have the 
following bounds involving 

„H.,:^^(l.^)-" 
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for a fixed large ^ 10; namely 
dx 



(14) i!_(exp(-2^ic(a;J •)<Ap)(x) < C'{n)\I\('/^^-^^\wj{x)\ 



for each n > 0. 

We are working with a given finite set of p-multitiles P (with p a 1- or 2-index) and 
we let Mo = Mo(P) be the smallest integer M for which all tiles are contained in the 
square [—2*^,2*^]^. Throughout this paper we fix 

H*°P = {rj : \r]\ < Ci2*^°+^°, r] = n2~*^°~^°, for some n G Z } 

as the set of admissible top-frequencies for trees, as in the following definition. 



Definition. Consider a triple T = {T-j- , I-j- , (,t) j with a set of multitiles T-j-, a dyadic 
interval If C [—2^'^° , 2^^° ) and a point ^7- G H*°p . We say that T is a tree if the following 
properties are satisfied. 

(i) I Cir for all P = (/,a;„) G Tf. 

(a) If P = {I,u}u) and ujm denotes the convex hull of C20JU U C2W; then 

■■= [er-f^,er+^) 

is contained in Um- 

We refer to Ij- as the top interval of the tree, and to ^7- as the top frequency of the 
tree. 

In order not to overload the notation we usually refer to the set T as "the" tree 
(keeping in mind that it carries additional information of a top frequency and a top 
interval), and we shall also use the notation It, and ujt in place of I-j-, ^7- and uif. 
With this convention we also define 

Definition. 

(i) A tree (T,It,^t) is l-overlapping if 6.T G ^2^;; for every P £T. 
(a) A tree {T,It,Ct) is l-lacunary if £,t C2UJ1 for every P 

Notice that the union of two trees with the top data It,£,t is again a tree with the 
same top data. Also, the union of two /-overlapping trees with the same top data is 
again an /-overlapping tree with the same top data. 

We split our finite collection of multitiles into a bounded number of subcollections 
satisfying certain separation conditions (i.e. henceforth all multitiles will be assumed to 
belong to a fixed subcoUection). 

Separation assumptions. 

(15) If P,P' satisfy \uj'J < \uju\, then \uj'J < —— — |a;„|. 

(16) If P, P' satisfy Ciuju H Ciuj'^ / and \uju\ = then Uu = ^'u- 
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As immediate but important consequence of the separation assumptions is the fre- 
quently used 

Observation 4.1. Let T be a tree satisfying the separation properties (|15p and (|16p . 

Then the following properties hold. 

(i) If T is an l-overlapping tree, P, P' G T, and \uj'J\ < \ojy\ then C^uju H 02^^^ = 0. 
(a) If T is an l-lacunary tree, P, P' £ T, and \uj'y\ < \oJu\ then C^uji H Csuj^ = 0. 
(Hi) IfP,P' £T, P^ P', and \uju\ = {ui'J, then I D I' = 0. 



As in previous proofs of Carleson's theorem (in particular |20j ) we shall split the set 
of multitiles into subsets with controllable energy and density associated to the function 
/ and the set E, respectively. Here we work with the following definitions. 

Definition. Fix f G L^(M) and a measurable set E C M.. Given any collection of 
multitiles P we define 



■gy(P) = sup j^X^I^/''^^)!' 



ener^ 

T 



where the sup ranges over all l-overlapping trees T C P. 
Given a measurable set E cM. we set 



/I /■ / \x - c(It)\\-'^-J^ I 
density(P) = sup (l + J \!^) Yl l"'^^^)!'' l-rfe-iC^)) dx\ 



k=l 

where the sup is over all non-empty trees T C P. 

Remark. Concerning the terminology, one can argue that the squareroot should be omit- 
ted in the definition of an energy. However we work with the above definition to conform 
to |20j and other papers in time-frequency analysis. 



Lemma 4.2. Let \ f\ be bounded by Ip. For any family P of multitiles the density o/P 
(with respect to the set E) and the energy (with respect to f) are bounded by a universal 
constant. 



Proof. Clearly the density is bounded by f^{l + dx < 3. Concerning the energy 

bound we let T be any /-overlapping tree, and split f = f + f" where /' = Is/^/- 
We estimate Eperlif ,<Pp)\'' < ll/'IU^ (Eper K/'> 'Ap)!')'/', by the Cauchy-Schwarz 
inequality. Now use that the supports of the (pp are disjoint for different sizes of frequency 
intervals, and then, for a fixed size, use the bounds (jl4p (for n = 0) to see that 

2 ^1/2 ^ ^ ^ ^ ^\l/2 



L2 



Per jez Per pgt 

Hence, 

Per 
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Furthermore, since \f"\ < we have the estimate 

\{f",M\ < 1/1^/^(1 + dist(/,M\3/T)/|/|)"(^"') < \I\'/\\I\/\lT\f"' 
Summing in P, we obtain 

Per 

Combining the estimates for /' and /" we see that X^pst K/' ~ 1-^^"! for every 
Z-overlapping tree and it follows that the energy of P with respect to / is bounded above 
by a universal constant. □ 



The following proposition allows one to decompose an arbitrary collection of multitiles 
into the union of trees, where the trees are divided into collections Tj with the energy 
of trees from Tj bounded by . The control over energy is balanced by an L'^ bound 
for the functions Nj^£ := X^^^gt It' contrast to [20] and [12], it is necessary here 
to consider q > 1 and ^ > in order to effectively use the tree estimate Proposition 15.1 
with q > 1. Note that such L*^ bound for Nj^i is established by combining (jl7p and (jlSp 
below. The bound ([20]) permits one to make further decompositions to take advantage 
of large \F\ in the L"? bound for the Nj^£ while maintaining compatibility with bounds 
for trees with a fixed density obtained from Proposition 



Proposition 4.3. Let £ > 0, let \ f\ be bounded above by Ip and let F be a collection 
of multitiles with energy bounded above by £. Then, there is a collection of trees T such 
that 

(17) J;|/t|<^:-VI 

TeT 

and 



energ 

and such that, for every integer i >0, 
(18) II E 1 



:y ( P \ U ^ ) ^ ^/^^ 

\ TGT / 



Tgt 



< 2''£ 



2£c-2 



BMO 

Furthermore, if for some collection of trees T' , 

(19) P= IJ T' 

T'GT' 

then 

(20) < E i^^'i- 

Tgt T'GT' 



Above, and subsequently, || • || _bj\/o denotes the dyadic BMO norm. 



Proof. We select trees through an iterative procedure. First, if energy(P) < £/2 then 
no tree is chosen and T = 0. 
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If energy (P) > iS/2 then we observe that there is an /-overlapping tree S C P for 
which 

(21) ^ J^K/,0P)P>£:V4. 

There is only a finite number of such trees and as Si we choose one for which the top 
datum ^5 is maximal (in M). Note that the maximality can be achieved as we restrict 
all top frequency data to the finite set H*°p. Let Ti be the tree in P which has top data 
iS.Si 1 Isi ) cind which is maximal with respect to inclusion. 

Suppose that trees Sk,Tk have been chosen for /c = 1, . . . , j. Set 

p,=p\Ut, 

k=l 

If energy(Pj) < £/2 then we terminate the procedure, set T = {Tf:}i<k<j and n = j. 
Otherwise, we may find an /-overlapping tree S C Pj such that (I2ip holds. Among 
/-overlapping in Fj satisfying ()2ip choose one with maximal top-frequency (in 
and label this tree Sj+i. Let Tj+i be the maximal tree in Pj which has with top data 
(CSj+i 5 ^Sj+i ) and which is maximal with respect to inclusion. This process will eventually 
stop since each Tj is nonempty and P is finite. 



Proof of ( (j7| j. It suffices to show 



(22) 

Since the Sj satisfy ([2T]). we have 



\F\ 



5^1/5,1) <16|F|-2(J] J] K/,0p)| 



i=i PeSj 



Now 



i=i PeSj 



j=i Pe5, 



< 



j = l P£S, 



n n 



j=i pg5j fc=i P'e5j 

where in the last inequality we used |/| < Ip. By symmetry, it remains, for (|22p to show 
that 

n n n 

(23) EEE E \u^<t>p)\\{f,ct^p')\\{4>pAp')\<s^Y.\^s,\ 

j=ik=iPeSj p'eSk.\i'\=\i\ i=i 

and 

(24) EEE E \{f^M\\{f,^p')\mp,^p')\<£'j2\^s,\. 

j=l k=l PeSj P'GSfe:|/'|<|/| j = l 

In both cases, we will use the estimate 



(25) 



l(0P,'/'P')| < [^y^\wi,lr 
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which holds whenever |/'| < |/|. 

Estimating the product of two terms by the square of their maximum, we see that the 
left side of (IH is 



^2EEE E \if^<pp)\'\i^p^^p')\- 

j=i k=i PeSj p'eSk.\r\=\i\ 

Recall that {4>p,(l)'p) = unless Caw^ n CsLoi, / 0. Thus, by ([16]), ([25]) and the fact that 
the Sk are pairwise disjoint, we can estimate the last display by 

^EEk/''^^)!' E <EEk/'^^)i'^E^V5,i. 

j=lPeSj I'-\I'\ = \I\ j=lP&Sj j=l 

This finishes the proof of (|23p. 

Applying the Cauchy-Schwarz inequality, we see that the left side of (|24p is bounded 

by 

E( E \if^^p)\'f\ E (E E \{f^^p')mp^^p')\)T'- 

j=i PGSj p&Sj k=ip'eSk.\P\<\i\ 

Twice using the fact that the energy of P is bounded by £, we see that the last display 
is 

2\ 1/2 



|i/2. 



2 



i = l P&Sj fe=l P'g5fc:|/'|<|/| 

Thus, to prove (j24|) it remains to show that, for each j, 

(26) E(E E \{<^pMp'r^p')\y <\is, 

PdSj fc=lP'e5fe:|/'|<|/| 

Again, we only have \{(j)p, \ Ipi\^^'^4>p')\ nonzero when C^Uu n C^oj'^ ^ which can only 
happen if swpC^oJu S C'^uj'^ or inf Csw^ G C^uj'^. Applying (p5]) . we thus see that the left 
side of (j26p is dominated by a constant times the expression 



Ei^H(E E (-/^P)) +Ei^H(E E (-^'1 

P&Sj k=l P'65fc;|/'|<|/| P(^Sj k=l P'GSfc:|/'|<|/| 

sup C3U}u£C3U}'^ inf C3Ll)u<^C3Uj'^ 

We now claim that, for each P G Sj (with time interval I), 



(27) (Y, E (^/ap))'<(^/,lM\/s.) 



k=l P'eSfe:|/'|<|/| 
supC3<^„GC3aj(, 

and that the same inequality with sup replaced by inf in the P' summation holds as 
well. To see this consider two multitiles P^ = (I-^,a;^) S S^i and P^ = (/^,w^) G 3^2, 
pi / gQ ^Y^^^ |j2| < |ji| g^^j (j^^i PI (-^^^2 _^ 0_ rpj^g ^^g^ condition implies ki / ^2 

(since and S^^ are ^-over lapping) . The inequality (|27p is immediate if we can show 
that /I and are disjoint and if in addition |/^| < |/i|, then I2 does not belong to 
the top interval of the tree ■ Now, if = |/^|, then from (|16p it follows that 
ool = ool and hence, since P^ ^ P"^, we have n = 0. If |/^| < |Ji|, then by (fT5]) 
< since C^ul n Cso;^ / this implies that inf C2UJI > supC2a;f. As 

both trees are /-overlapping the top frequency of S^i belongs to C2^jjj and is above the 
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top frequency of which belongs to C2Ujf. Thus by the maximahty condition on the 
top frequency in the selection process of the trees we see that the tree was selected 
before the tree 5^2, i-e. ki < K2- This implies that P2 does not belong to the tree T^^^, 
and since the interval ujs^^ is contained in the convex hull of ujf and we see that the 
time intervals and It = Is cannot intersect. Thus n = as C /r • This 
concludes the argument for (j27p . 

Now by the disjointness condition we see that indeed the left hand side of is 
bounded by a constant times 

PeSj i-2''<\lsj\ P€Sj:\I\=2' 

< 1/5,1 sup Yl i'^l^'^Ws^) 

One may check that, for each i, 

P€Sj:\I\=2'! 

and (j26p follows. We have already seen that (|26p implies (|24p ; this completes the proof 

of (HZD. 

Proof of il8\) . We need to show that for each dyadic interval J, we have 



I \ J J ^^'X' \ J J T^T 



dy 



dx < 2''£-\ 



\J\ 

This is an immediate consequence of 

(28) \It\<£^^AJ\. 

tgt 

Let 

T = {T G T : It C 2^+^ J, \It\ < \ J\} 
and note that if T G T with 2^/t n J / 0, J then T G T. Write f = f + f" where 
l/'l < lFn2^+5j and |/"| < lirnR\2«+5j- 



We will write T as the union of collections of trees T""*" U T° U U . . . each of 
which will have certain properties related to the energy. For each tree T G T there is an 
1-overlapping tree S = S{T) chosen in the algorithm above with Is = It and 

(29) j;K/,<^p)P>fV4. 
Let 

\Is\^^ 

For j > 1, define 

T^={rGT: sup J-Y\{f"Ap)\'>£'n(^} 
S'CS{T) l^S'l 
\Is,\=2-^It\ 
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where, for each T, the sup above is taken over all /-overlapping trees S' with S' C S{T). 
Finally, let 

rpmain = {T G T \ (T° U U . . .)}• 

We split the sum (j28p into the "main" term involving trees in T™^"^ and an error term 
involving Uj>oTj. 

We first consider the main term. It is our objective to prove 
(30) Yl \It\<£-'2'\J\. 

Let T G x™'^™ and let S' be any /-overlapping tree contained in S satisfying < 
Since the energy of P is bounded by £ and since T is not in any T-^, we have 

I A I pgg, I i I p^g, I i I p^g, 

From dm and the fact that T ^ T°, we have 

E \{f',m'>£'/8-£'ne = £'n(^- 

This inequality allows us to essentially repeat the above proof of ()17p . The /-overlapping 
trees S{T) form a (finite or infinite) subsequence of the sequence Sj which we denote by 
Sji^i^) so that we have j(z^) > for > v' . We need to prove the analogue of (j22|) 

which is 

^"^^^ \p pi 2^+5 J| ^ I'^'^jX'') I ^ ^ 

and as before we are aiming to estimate the square of the expression on the left hand 
side by the expression itself. The Sj(^^^ satisfy 



and therefore 

^2 



^T^EI^-.J) < 256|Fn2^-V|-(E E K/'^-)l 

We continue to argue with exactly the same reasoning as in the proof of (|22|) . replacing 
/ with /' and F with F D 2^+^ J. This leads to the proof of jn} and thus to 

E \iT\<£-'\Fn2'+^J\ 

which is clearly < £~'^2^\J\. Thus (pOj) is established. 

For the complimentary terms we prove better estimates, namely, for j = 0, 1, 2, . . . , 
(32) E \It\<2-'-^£-V\- 
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For each T G we have |/| < 2~^\It\ < 2~J| J| for all P G S{T) and thus 

TeTJ' TgTJ Pg5(T):|/|<2-J|J| 

Since the S are pairwise disjoint, the right hand side is 

k>j P:|/|=2-''VI 

Fixing k > j, we apply Minkowski's inequality to obtain 

E \if"^^p)\'<( E ( E \{^Kf,<Pp)\'yy 

P:|/|=2-'=|J| ^K:\K\=2^-''\J\ P:|7|=2-''| J| ^ 

where above, we sum over dyadic intervals K and use the fact that /" is supported on 
M \ 2^+5 J. 

Now note that (ppi = cexp{2TTi{c{uj'^) — c{uju))-) (pp when / = /'. Thus if V{Io) is a 
collection of disjoint multitiles with common time interval /o, if 5 is supported on CIq 
and if Pj^ is any fixed multitile in V{Io) then Bessel's inequality gives 



E \i9,^p)\^<\Io\ J \9<pPjfdx. 



P&V{Io) 

We apply this observation to the inner sums in the previous display and obtain the 
inequality 

(33) Yl \if"^<pp)\'^^~'\J\( E ( E U^f"M\\l.fy 

P:|7|=2-'=|J| K:\K\=2'^-''\J\ /:|/|=2-''| J| 

/C2*+lj Kn2^+^J=0 /C2^+ij 

where for each /, Pj is any multitile with time interval /. Since \f"\ < 1 the bound (jl4p 
yields 

Ih f"^ l|2 < h -U <iist{K,I) \-N 
W^Kl (PPi\\l2 ^ [i^^ \T\ — ) ■ 

Applying it with large N we see that the right hand side of is < 2~('^+^)(^~^)2~'^| J|. 
Summing over k > j we obtain inequality (I32p . This concludes the proof of (llSp . 

Proof of (fSOj) . For each T G T, let 5 = S{T) be the corresponding /-overlapping tree 
from the selection algorithm above and recall 

^ |/t|(^:/2)^ < E \{f,M\'. 

TeT PeUTex-S' 

Since P = Ut'gt' right side above is dominated by 

(34) E E ia<^p)i'+E E i(/''^^)i' 

T'eT' Per'nUygT •S' T'eT'PeT'nUrgx'S' 

+ E E i(/''^^)i'- 

T'6T' PGT'nUTgT'S' 

sup C2i^i <5t/ <inf Csa;^ 
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For each T' G T' the set of tiles P £ T' r\y]^^r^ S{T) with the property that S^t' £ is 
by definition an ^-overlapping tree. Thus, since P has energy bounded by £, we estimate 
the first term in (j34p 

E E ia^oP< E ^Vt'I. 

r'sT'PeT'nUTsT'S' t'gt' 

Cy/ GC2(^! 

For the estimation of the second term in (I34p we observe that any fixed multitile forms 
an ^-overlapping tree (with respect to some top data), and we use the energy bound for 
fixed tiles. We observe that the rectangles {/ x [inf Csw^, sup (720;^) : P G IJxeT ^} 
are pairwise disjoint. Indeed ii P & S, P' £ S this follows since S is /-overlapping 
(c/. Observation 14. ip . If P G S^^ and P' £ «^i < ^^2 then ,^5^ > ,^5^ and an 
overlap of [inf Csw^, sup C2ti;„) and [inf Csw^j, sup C2ti;(j) would imply that P' belongs to 
the maximal tree with the same top data as S^i, i.e. this would imply that P' € T^-^ 
which is disjoint from 8^2- 

This allows us to estimate for any fixed T' € T' 

E \{f,^p)\'< E s'\i\<s'\it'\. 

Per'nUrgT'S' Per'nUrgx^ 

S^j,/ >inf CsuJu ij^i >inf C-j,u)u 

Now sum over T' G T' and it follows that the middle term in (I34p is < ^jn,grj„ 
and summing 



For the estimation of the third term in (|34p we begin with a preliminary observation, 
also related to the selection of the 8. Suppose P£T'ri8,P£T'ri8 where T,T £ T 
and 

Ct' e [sup C2UJ1, inf Csuju) n [sup (720;;, inf Csa)^), 
and suppose I C I and P ^ P. From (I16p we have I I- We also have inf C2tD/ < 
sup C2i^i since otherwise it would follow that 8 was selected prior to 8 and hence P £ T 
which is impossible. From (jlSp . we have inf C2ti;; > supCaW; and so P is in the maximal 
/-overlapping tree contained in T' with top data (/, inf C2OJ1). 

For each T' £ T' let T" be the collection of multitiles P £ T' r\ (JreT ^ with ^t' G 
[sup (720;; , inf Cs^Ju) and / maximal among such multitiles. Then 

E E \u,<^p)\'< EE E K/'^-)i' 

T'£T' PeT'nUTex-^ T'GT' P"£T" PGr'nU^gT 

sup C2ti^i <5t' <inf C:iLJu sup 02^1^; <5t' <inf Csuiu 

Id" 

Considering the discussion in the preceding paragraph, we may apply the energy bound 
to the maximal /-overlapping tree contained in T' with top data (/, inf C2UI1) ■ Since the 
/" are disjoint sub intervals of It' we see that the right side in the last display is bounded 
by 

E E < E 2^^Vt'|. 

This completes the proof of (f20]l . □ 



The proposition below is for use in tandem with Proposition 14.31 
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Proposition 4.4. Let F be a collection of multitiles and // > 0. Then, there is a 
collection of trees T such that 

(35) j;|/H</^-'''|i?l 

TeT 

and such that 

density (P \ Utst^) < fi/2. 

Proof. We select trees through an iterative procedure. Suppose that trees T,-, T-^, T~ 
have been chosen for j = 1, . . . ,k. Let 

k 

Pfc = p\ Ur,-ur+urr. 

If density(Pfc) < then we terminate the procedure and set 

T = {ri,T+,rf,...,Tfc,r+,r-}. 

Otherwise, we may find a nonempty tree T C Pfc such that 

(36) ^ X (1 + ^T^)~' E i«'^(^)r' > 

Choose T/c+i C Pfc so that l-^r^+il is maximal among all nonempty trees contained 
in Pfc which satisfy (I36p . and so that T^+i is the maximal, with respect to inclusion, 
tree contained in P^ with top data {iT^+x^^Tk+x)- Let T^j^^ C P^ be the maximal tree 
contained in P^ with top data (-^t^+i , Ct^+i + {C2 — l)/(2|-^Tfc_|_i I)) and T^-^ C Pfc be the 
maximal tree contained in P^ with top data (/Tfe+u^Tfc+i — (C2 — l)/(2|-^Tfc+i |))- Since 
each Tj is nonempty and P is finite, this process will eventually stop. 

To prove (I35p . it will suffice to verify 

(37) ^|/T,|<^"^'|i?|. 

j 

To this end, we first observe that the tiles It^ x ojt^ are pairwise disjoint. Indeed, suppose 
that [Itj X i-OTj) n {It I x '^t ,) 7^ and j < j' . Then, by the first maximality condition, 
we have \lTj\ ^ l-^Tj'l and so It., C Itj and \ujTj\ < I'^T^/I- From the latter inequality, 
it follows that for every P £ Tj/, either 0;^. C cOm, C UJm, or ujrp~ C Um- Thus, 

Tj' C Tj U Tj' U T^~ which contradicts the selection algorithm. 

Breaking the integral up into pieces and applying a pigeonhole argument, it follows 
from ()36p that for each j there is a positive integer ij such that 

(38) \It,\<C2-^'^^i-''' f \ak{x)\'' dx. 

For each £ we let = {Tj : ij = £} and choose elements of T^^); T^^\t^'^\... 
and subsets of T^^^: Tf\T2 \ ... as follows. Suppose T-^^ and T^^^ have been chosen 
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for j = 1, . . . ,k. If T^^-* \ U^=i '^f^ is empty, then terminate the selection procedure. 
Otherwise, let Tjf^^-^^ be an element of T^^) \ IJj=i with \I (i) \ maximal, and let 



k 

tI;1, = {Te tW \ U Tf : i2'lT x ujt) n (2^/^(.) x u;^,,, ) / 0}. 



By construction, T^^^ = (J • T^-^^ and so 



(39) |/t|<E E I^^I- 

Using the fact that the tiles Itj x are pairwise disjoint, and (twice) the fact that 

I-^tI < \ for every T G T^f\ we see that for each j 
j 



/J^ ^ rj-i ( ^ ) 



From ()38p . we thus see that the right side of (j39p is dominated by a constant times 

2- v^7 E ,0 E k(^)r'^^ 



i J fc:,tfc_i(x)Ga;^(£) 

3 



<2-2V-"'|ii;|. 



where we used the disjointness of the rectangles "^^IrpW x u; (o, and X^^^ |afc(x)|''' < 1. 

i 3 

Summing over we obtain (j37|) . □ 



5. The tree estimate 



In this section we prove the basic estimate for the model operators in the special case 
where the collection of multitiles is a tree. In what follows we use the notation V^Af{x) 
for the r- variation of i— )■ Akf{x), for a given family of operators indexed by A: G N. 

An essential tool introduced to harmonic analysis by Bourgain [IJ is Lepingle's in- 
equality for martingales ([22]). Consider the martingale of dyadic averages 

mf]{^) = TY^ [ f{y)dy 

\^k[X)\ Jl^(x) 

where Ik{x) is the dyadic interval of length 2^ containing x. It is a special case of 
Lepingle's inequality that 

(40) ||VE(.)[/]||iP <Cp,,||/||LP 

whenever 1 < p < oo and r > 2. Simple proofs (based on jump inequalities) have 
been obtained in [1] and [27] (see also [9], [15] for other expositions). Inequality ([30|) 
has been extended to various families of convolution operators ([I],[I1], [2], [H]). Let 
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■0 be a Schwartz function on M with J ijj = I, for each k let Vfc = 2~''^(2~*^-) and let 
Akf{x) = ipk*f- Then 

(41) IIVWIIl. < K p < oo, r > 2, 

follows from (|40p : the essential tool is the square function estimate 

1/2 



(42) IK Yl \Akf-mf] 

k=—oo 

For a proof we refer to [13] or |15] . 



1 < p < oo. 



We shall use ()4ip to prove the following estimate in terms of energy and density. The 
bound will be applied in Section [7] with q = r' and (7 = 1. 

Proposition 5.1. Let S>0,0<^<3 and let T be a tree with energy hounded above 
by £ and density bounded above by fj,. Then, for each 1 < q < 2 



(43) V(/,0p)0paplij <^:/.'-°(i'^''/'?)|/r|V5. 

II ^-^ Li 

Furthermore, for £ > we have 

(44) II T.(f^<Pp)'PpaplE\\ < 2-^(^-io)^:^--(i.^'/.)|j^|i/.. 

We remark that the bounds above also hold for 2 < g < oo, but the result for this 
range of exponents is not needed for our purposes; it requires an additional estimate 
for T.P(iTU^4^p)(t)P- 

Proof. We begin with some preliminary reduction. Every tree can be split into an l- 
lacunary tree and an /-overlapping tree and it suffices to prove the asserted estimate for 
these cases. 

If the tree is /-overlapping we introduce further decompositions. By breaking up T 
into a bounded number of subtrees we may and shall assume without loss of generality 
that for each P £ T, 

(45) .^T G f^/ + for some integer j with \j\ < C2. 

Moreover we shall assume, for every /-overlapping tree T, that either < infw/ for 
every P G T (in which case we refer to T as /"-overlapping) or > inf for every 
P G T (in which case we refer to T as /"''-overlapping). Every /-overlapping tree can 
be split into an /"-overlapping and an /"^-overlapping tree. For the remainder of the 
proof, we assume without loss of generality that T is either /-lacunary or /"'"-overlapping 
or /"-overlapping, and that for the last two categories property (jl5|) is satisfied. 

Let J be the collection of dyadic intervals J which are maximal with respect to the 
property that I <^2>J for every P G T. 



Our first goal is to prove that for each J € J 



PeT:\I\<C"\J\ 
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where C" > 1 is a constant to be determined later; we shall see that C" = 8Ci(C2 — 
is an admissible choice. 

By Holder's inequality, we may assume that q > r'. Fix P & T with |/| < C"\J\. 
From the energy bound, we have 

(47) \\{f,cl>p)cl>paplE\\LHJ) ^ -^(1 + ^^^y^'hp'^EWLHJy 

From the density bound applied to ~ 1/(^2 — 1) nonempty trees, each with top time 
interval I, we obtain 

Since / ^ 3J, it follows that 1 + \x - y\/\I\ < C(l + dist(/, J)/\I\) for every x £ J and 
y £ I. Thus 

l|apli.|lL(.) < WaplEW^.^j^ < (l + ^)Vl/ 
where, above, we use the fact that \ap\ < 1. Thus we can replace (j47l) by 

||(/,^p)^papl^||,,(,)<f//^|/|Vni + ^)-(^-^\ 
Summing this estimate and using the fact that T is a tree, we have 

y: {fMct^papiE ^^^^^ <^://''(2-vi)^/ni+^nS^)"^''"'^ 



PeT:|/|=2-fc| J| 

and summing over k gives (jl6 



We now use ()46p to prove (I44p for £ > 4. Indeed, using the maximality of each J, we 
see that if £ > 4 and J n (M \ 2^/t) / then dist(/T, J) > \ J\/2 and | J| > 2^-3|/r|. It 
thus follows from (1461) that 



II Y^fAp)<t^paplE , ^ < (|/T|/|J|)(dist(/T, J)/|J|)"2^:;,--(iy/.)|J^|l/52-nA^-10) 

" PeT ^'^-^^ 
whenever J n (M \ 2^ It) ^ 0- Summing over all J, we thus obtain ()44p for £ > 4. 



It remains to prove 



PeT ^ ' 

which, by ()46p . follows from 

(48) V (/,0p)0papl£; ^ <£:/i--^'^'/9)|j^|i/9. 

PGT: |/|>C"|J| 

Again, by Holder's inequality, we may assume that q > r' . Let 

PGT:|7|>C"|J| 

The first step in the proof of (|48p will be to demonstrate 



(49) / Y W{xj' dx<ii^'\J\. 

Jjr\E , , , ^^r^ 



By the maximality of J there is a multifile P* = (/*, w*) G T with /* C 3 J where J is the 
dyadic double of J. This implies that there is a dyadic interval J' with \J\ < \J'\ < 4| J| 
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and dist(J, J') < |J| and /* C J'. We wish to apply the density bound assumption to 
a tree consisting of the one multitile P* , with top interval J' and suitable choice of the 
top frequency. We distinguish the cases that T is /"^"-overlapping, /"-overlapping, or 
/-lacunary. 

If T is /"^"-overlapping then T' = J') is a tree. For every P £ T we have 

uji C [^T-^,^T + ^) and thus, with \I\ > C"\J\, uji C [Ct - 2#pT> + 2#Pt)- Thus 
with the choice of C" > -^^^ we have Oj C ujt'- 

If T is /--overlapping then T' = {{P*},^t + J') 

is a tree. Using that T is / - 
overlapping, we see that uji C [S,t,^t + 2C'^ \ J\ ) every P & T with |/| > C"|J|. Thus, 
by choosing C" > we have fij C wt'- 

If T is /-lacunary then T' = {{P*},^t- J') is a tree. Using that T is /-lacunary, 
we see that uji C [£,t — Ic^^j^t) for every P £ T with |/| > C"\J\. Thus, by choosing 
C" > 8j=r^ as in the first case we have j C lo^'- 

In any of the three cases, the density bound gives 



\J' 

and hence (1^91). 



fc:Cfc_i(a;)ei^j,/ 



We now show that if T is /-lacunary then (j48p follows from (j49p . We start by observing 
that for each x there is at most one integer m and at most one integer k such that there 
exists a P £ T with |/| = 2™, G uji, and ^fc(x) G w/j. Indeed, suppose such 

a P exists, and P' £ T with |/'| > |/|. If |/| = \I'\ the uniqueness of k is obvious. 
Suppose \I'\ > Since T is /-lacunary, we have inf(ti;;) > sup(w/) by p^ . and so 
< inf(a;j'). We also have ^k{x) > mf{ujh) > sup(ci;t) > sup(a;;') since C2UJu^^h = 
and C20j'i. It follows that there is no k' with ^y_i{x) G uj[. 

Now let a{x) = ak{x) if there exists an m{x) as in the previous paragraph with 
2in{x) y C"\J\, and 0(2;) = otherwise. We then have 

V {f,4>p)<PpapiE \ < [ (Hx)\ J2 \{f,4>p)Mx)\y dx. 

^"^^ Li(J) I TnR ^ ' 

PGT:|/|>C"|J| PGT:|/|=2'"(^) 

From the energy bound (applied to multitiles) and the bound (jl4p for |0p|, the right 
side is bounded by 



\x~c{I)\\-N' 



1 

dx. 



[ [\a{x)\ S\I\m{x)y dx < [ [\a{x)\ £: (l + ^ 

Noting that EpeT:l/l=2'»w (1 + 1^ " c(/)|/|/|)~^ < C, we see that the last display is 



<S^ ! \a{xWdx<£'^ [ ( Yl 



dx, 
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by our choice of a{x). Using (ji9]) . q > r', and the fact that ^ |afc(x)|''' < 1, the right 
hand side is < S'^fx'^ \J\. We may now sum over those J G J which satisfy J D 16It 7^ 
and (I48p foUows for /-lacunary trees. 



It remains to prove ()48p for the case when T is /-overlapping and satisfies condition 
l5|) . For each J G J, and each x £ J D E we have by Holder's inequality 

(50) Yl {fAp)Mx)ap{x) = Y,ak{x) 

P&T:\I\>C"\J\ k P&T:\I\>C"\J\ 



E 



< 



fc:5fe_l(x)eOj 



E 

fc:5fe_l(a;)er2j 



E 



{f,(t>p)Mx) 
{f,<Pp)M^) 



r\ 1/r 



PeT:\I\>C"\J\ 
ffe_l(x)Ga;i,g(.(2:)Ga;ft 



Now let tp he a Schwartz function with ^p{^,) = 1 for |^| < Ci + C3 and '0(^) = for 
\C\ > 2Ci. Define ipi = 2-V(2"^-)> and set 



erix) 



We will show that for any x and any k with ^^-1(2;) < (,kix), there exist integers £1,^2 
depending on x and k, such that 2^^ > | J| and 



(51) 



E 

PeT:\I\>C"\J\, 
Cfc_i(a;)Gi^i,Cfe(a;)ei^h 



PeT 



|/| for some multifile P, 

bp)(j)p{x). 



From ()15p we have, for each ^ such that 2 

PeT PgT:|/|>2« 

Thus, to prove (j5ip it will suffice to show that there exist integers ^l and £2 such that 

(52) {P G T : |/| > C"\J\,ik-i{x) G w/,6(^) G ^4 = {P G T : 2^^ < |/| < 2^^. 

Again using (|15p . we see that for P,P'£T with |/| < |/'| we have info;^ < iniuh, and if 
we are in the setting of p-multitiles where p is a 1-index, we have the stronger inequality 
supcj^ < inf Wft. Thus, (|52p will follow after finding ii and £2 with 

(53) {P G T : efc-i(x) G ui} = {P € T : 2^^ < |/| < 2^^. 

Here we use assumption (|45p . The displayed equation follows when |j| > 1 from the fact 
that uji n a;[ = if P,P' £ T and |/| < |/'|; it follows when j = from the fact that the 
intervals {oJi : P G T} are nested. Finally, when j = ±1 it follows from the property 
that if P, P', P" G T, |/|, < |/'| < and uji n uj'{ / then uj'{ C uj[ C t^/. Thus we have 
established (j53|) and consequently (j52]) and ([5T]) . 



Using dsn, we have 



E 



E 



(/, 4>p)4>p{x] 



r\ 1/r 



PeT:|7|>C"|J|. 
Cfc-i(a;)ew;,^fc{a;)Gi^(i 



< ll(eT^fc) * V(/,0p)0p(x)| 



pgt 



y,r(z++iog2(|j|)) 
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where the notation refers to the variation norm with respect to the variable k, restricted 
to {k £Z: \k\ > logad J|)}. For logsd J|) < < we have 



Per 



P€T 



and so, for x G J 



l(eT^fc) * ^{fAp)(t>p{^ 



P&T 



ly;{Z++log2{|J|)) 



< sup sup 



x&J R>\J\ \R\ Jx-R 



x+R 



l(eT^fe) * ^{f,cl)p)(l)p{y)\\yr 



P£T 



;'-(z++iog2(|j|)) 



dy. 



We now integrate the qth. power of the expressions in (|50p over E H J and obtain 

q 



{fAp)<PpaplE 

PGT:|/|>C"|J| 

■X+R 



< 

LHJ) JjnE 



^ \ak{t)\^'dtY'''x 

fc:5fe_i(i)GCj 



2 rx+JX 

sup sup — / Wieri'k) * (/> 'Ap)'^p(y)||v7-(2++iog^(|j,)) dy. 



< 



dx 



M[\\^k*{eT'Y.^f^<t>p)4>p)\\v^]{x) 
PeT 

where A4 is the Hardy-Littlewood maximal operator. For the last inequality we have 
used q > r', |ofc(x)|^ < 1 and inequality (09]). Summing over J £ J gives 



{f,(l)p)(l)paplE 

P£T:\I\>C'"\J\ 



Li{ieiT) 



\v,: [X) 



PeT 



IL|(16/t)' 



Since g < 2, it follows from Holder's inequality that the right side above is 



pgt 



L2(16/t)- 



Applying the variation estimate (|4ip with p = 2 and the estimate for Ai one sees 
that the display above is 



< 



\It\ 



(2-q)/2\ 



PeT 



')P (pp 



1 

L2' 



To finish the proof, it only remains to see that || ^P(cr(/, </'p)0p||^2 ^ £'^\It\- The 
left side of this inequality is dominated by 

E E \{f^^pmf^<p'pm<pp^<pp')\ < 2 E \(f^^p)\' E k-^^^p')!- 

PeTP'eT PeT P'eT 

Since T is an /-overlapping tree, we have {4>p,(f)p') unless |/| = |/'|, in which case, we 
have \{(l)p,(pP')\ < (l + dist(/,/')/|/|)-^. Therefore we obtain the estimate 

II E^/''^^)'^^||L ^ E \(f^^p)\' ^ ^^Vt|. 

PeT PeT 
This concludes the proof of (j48p and thus the proof of the proposition. □ 
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6. Two AUXILIARY ESTIMATES 



Before we give the argument on how to decompose our operators into trees with 
suitable energy and density bounds we need two auxihary estimates. 

The following proposition can be found in [33], p. 12, or as a special case of a lemma 
from 



Proposition 6.1. Let T be an l-overlapping tree. Let A > and f^A,!) = {.A/^d[1f] > M 
where Md is the maximal dyadic average operator. Then 

' ' P(^T:I(tnx,D 

The second auxiliary estimate is the special case of an estimate from [12] , but we will 
provide a proof for convenience. 

Proposition 6.2. Let V he a finite set of multitiles, and let X > 0, F C M, and \ f\ < 1^- 
Let nx = {M[1f] > A}. Then 

(54) II Yl if^M<Ppap\ ' 

PeF-.icn 



< 



L^R\n) AVf ■ 



Proof. Fix i and let Li G 0,\ he a, dyadic interval satisfying 
(55) 2^ Li C Ux and 2^+^Li ^ fix- 

By Minkowski's inequality, we estimate 

P:I=h 



f,^p)\') ■ 

P:I=h j=2 P:I=h 

Using orthogonality, the right hand side is bounded by 

oo 

(4|/,|)l/2||l4,J0pJ|^.+^(2^-+V£l)'/'l|l2.+i/A2^7,/</'PollL2 

i=2 

where Pq is any multitile with L = L^. Applying the bounds (|14|) and |/| < Ip , we see 
that the last display is 

oo 

< |F n 4/,|i/2 + ^ C2-J'(^-i)|F n 2^'+i/,|i/2. 

i=2 

Since 2^+^Le ^x, we have \F n 2^+^L(,\ < C2™'^^(^J)|/<.| A for each j. Thus, the last 
display is < (2'A|/i|)-^/^ and we have proved 

( E \{f,<Pp)\'f'<{2'xm'/'. 

P:I=h 

Similarly, 

sup \{fAp)\<2'm^'^ 

P:I=I, 
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and SO, by interpolation, 

(56) ( E \{f,^pWf'<i2'X)'/^'\M'/' 

P.i=h 

whenever 2 < r < oo. For each S^, Ii there is at most one P G P with £ ivi and I = I^. 
Thus, using the fact that, for each x, Xl^i < 1, we see that 

II Yl (/>'Ap)<Apap||^ <(2^A)V^'|/.l^/'ll'APollLHM\f..) 

PeP:I=Ii ^ 

where Pq is any multitile with Iq = Ii. Using the fact that 2^1^ C Qx, it follows that the 
right side above is 

< 2-^(^-2) A^/*^' I If I . 

For £ > let Ig be the set of all dyadic intervals satisfying (|55p. If / C then for each 
j > there are at most 2 intervals I' G Ig with I' C I and |/'| = 2^-'|/|. By considering 
the collection of maximal dyadic intervals in Ii, one sees that 

leXe 

Thus, 

II E (/>'/'p)'Apap||Li(R\n,)< 2-^(^-2) Ai/^'|i7,|. 
PeP-.ieXe 

Summing over i and applying the weak- type 1-1 estimate for M then gives (j54p . □ 



7. Conclusion of the proof 

Let r > 2 and r' < p < We shall conclude the proof of (jl3p . with 1 < l-Bl < 2, 
and thus of Theorem 11.21 It will then suffice, by Chebyshev's inequality, to show 

(57) / \SP[f]{x)\dx <C\F\^/P 

Je\g 

for any measurable set E with 1 < |i?| < 2, |/| < Ip and some exceptional set G = 
G{E,F) with |G| < 1/4. 

We shall repeatedly apply Propositions 14.31 and 14. 4[ By Lemma 14.21 the density of P 
(with respect to the set E above) and the energy (with respect to /) are bounded by a 
universal constant C. 

We distinguish between the cases |F| > 1 and |-F| < 1 and first consider the case when 
|-F| > 1. Repeatedly applying Propositions 14.3 1 and 14.4] we write P as the disjoint union 

p = U u T 

j>0 TeTj 

where each Tj is a collection of trees T each of which have energy bounded by C2-J/2 1 F| 1/2^ 
density bounded by C2~^/^ , and satisfy 
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For each j we apply Proposition 14.31 again, this time using (fTSj) and ([20]) to write 

U T=u u ^ 

TeTj fc>OTGTj_fe 

where each tree T € Tj ^ has energy bounded by C'2~(-'+'^)/^|F|^/^, density bounded by 
C2~^/^\ and satisfies 

(58) \It\<'^'- 



< 22^2-''+'' I Fl"^ 



Moreover, for every £ > 
(59) II ^^'iT 

Inequahty ([58|) imphes || X^T-gXj j. ^2^/^ H^^ ^ 2^"'"-' , and we may interpolate the and the 
BMO bound. Here we use a standard technique involving the sharp maximal function 
from §5 in [11]. It follows that for 1 < q < oo 



Let € > be small and C" > be large, depending on p, q, r. For each j, k, I define 

By Chebyshev's inequality, we have 

so setting G = {Jj^k,e>o^j,k,£ we have \G\ < 1/4. 
Applying Minkowski's inequality gives 



Is E (f'p)<PP"'p\ 
PeP 



+ EII^^ E l2^M2^-l/T E^^''^^)'^^«^ILl(K\G,,.,)7• 
From Holder's inequality, Fubini's theorem, and the definition of Gj^/^^i, it follows that 
the right side above is < {Si + S2) where 



5i= E E \\^EY.(f^cPpHpap\C,^ 

j,k>0 TGTj.fe PeT 



l/r' 



and 

5,= 5^|i.|-V(.V)2(i+.)0>.+2.)A.( E l|li^E(/'^^)^^«HIL'^'(M\2-.z. ''''' 
j,k>o tgTj fe PeT 



28 R. OBERLIN, A. SEEGER, T. TAG, C. THIELE, J. WRIGHT 

Applying Proposition 15.11 with the energy and density bounds for trees T G Tj^^, we 
see that 

52 < ^ |_p|-l/{9''^)2(l+^)(j+^+2^)A'-2-^(^-10)2-(j+fe)/2|_p|l/22-j/r'J^ ^ |/t|^^^''' 



i>i 

Choosing e small enough and q large enough so that 

2 111 

(1 + e)- - 1 < and < - 

r 2 g'r p 

we have 52 < |-F|^/p. We similarly obtain Si < thus giving (|57p . 

We will finish by proving (j57p for |F| < 1. Here, we let 

G = {x : M[1f]{x) >C"|F|} 

where M is the Hardy-Littlewood maximal operator and C" is chosen large enough so 
that the weak- type 1-1 estimate for M guarantees |G| < 1/4. 

By Proposition 16.21 and the fact that p > r', it will remain to show that 

(60) ||l^E(/''^^)'^^«^|LwK\n^'^''^'' 

PeP' ^ ^ ' 

where P' = {P G P : / ^ G}. 

Finally, it follows from Proposition 16.11 that the energy of P' is bounded above by 
C|F|. Repeatedly applying Propositions 14.3 1 and 14.4] we write P' as the disjoint union 

p' = U U r 

j>OT&Tj 

where each T,- is a collection of trees T each of which have energy bounded hy Co2-^/^\F\'^/^, 
density bounded by Co'2r^l'^ , and satisfy 

TST, 

We then have 

Is E '\f^^p)^P^P ^1 - E E l^-f^ '^{f,<Pp)(ppap 
PeP' i>o TeTj PeT 

Applying Proposition 15.11 we see that the right side above is 

<E E min(2-^V2|^|i/2^|^|)2-.A-'|j^| < J]2^-Amin(2-^V2|^|i/2^|^|)_ 

i>0 TgT, j>0 

Summing over j, we see that the right side above is < . This finishes the proof, 

since p > r'. 
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A. Transference 



In this section we show how to obtain Theorem 11.11 from Theorem II. 2i We employ 
arguments from chapter VII in the monograph by Stein and Weiss [31] in their proof of 
De Leeuw's transference result ([U]). The following Hmiting relation is used: 

Lemma A.l. p. 261). Let m G L°°{W^) with the property that every k is a 

Lebesgue point ofm. Define a convolution operator T on L^(M"') by the Fourier transform 
identity Tf{^) = m{^)f{$^) and a convolution operator on L^(T"') by the relation [Tf]j^ = 
m{k)ffl for the Fourier coefficients. Let, for x G M, 

vj{x) = e~^^'^ and vjr{x) = R~'^w{x/R). 

Then, for all trigonometric polynomials P and Q (extended as 1-periodic functions in 
every variable) we have 

(61) / f[P]{x)Q{x)dx=lim / T[PwR/ ,]{x)Q{x)wjij {x)dx. 



We need also need the following elementary fact on Lorentz spaces. 

Lemma A. 2. Let f € LP''?(T'^) and extend f to a function /p"' on M"^ which is 1- 
periodic in every variable. Let L > d/p and let w be a measurable function satisfying 
\w{x)\ < (1 + \x\)~^. Let wr{x) = R~'^w{R~^x). Then 

sup Il/P'^'-Wijll^p.^^igd) < Cp,g||/||ip.,(Trd) 



Proof We first assume p = q. Let Qo := [-^ ^]'^- IfA^GNandiV<i2<iV + l then 
wr{x) ~ N~'^{1 + \n\/N)~^P for x G n + Qq and by the periodicity we can estimate 
||/p-u;k||^ by C E„ez^ N-^{1 + \n\/Nr^P\\f\\l,^Q^^ which is < since Lp > d. 

For fixed L we apply real interpolation in the range p < L/d and obtain the Lorentz 
space result. □ 



Proof that Theorem implies Theorem \l.l{ We shall assume that \ < p < oo, 1 < 
qi < oo, and qi < q2 < oo and prove that the ^^''^^(M) —?■ LP'''^{M.;V'') for the partial 
sum operator 5 on the real line implies the corresponding result on the torus, i.e. 



(62) 



sup sup 

K 0<ni<---<nK 



1/r 



LP-12 (T) 



LP'i2 (T) 



By two applications of the monotone convergence theorem it suffices to show for fixed 
K £N with K >2, and fixed M G N that 



(63) 



K-l 

sup I V \Sn,^J - SnJ\' 

0<ni< - <nK<M ^ ~[ 



1/r 



LP-12 (T) 



< c 



LP-11 (T) 



where C does not depend on M and K. 



For n = (ni, . . . ,n/^) G Ng", 1 <i < K -1 define Tf{x,n,i) = Sn,+^f{x) - SnJ{x) if 
m < • • • < UK and Tf{x,n,i) = otherwise. Then the inequality ([63]) just says that T 
is bounded from L^'"?^ to LJ''''^{i°°{i^')) where the i°° norm is taken for functions on the 
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finite set {1, . . . , M}^ and tiie £^ norm is for functions on {1, . . . , A' — 1}. By duality 
([63]) follows from the L^' ''^'2 (i^ (i'^' )) — )• L^'^'fi inequality for the adjoint operator T*, i.e. 
from the inequality 



1 A'-l 
''^ 0<ni<---<n/f <M i=l 



K-l 



^ ^ ^ f ^ ^ \fn;i\ 



We fix an irrational number A in (0, 1), say A = l/\/2. We then define "partial sum 
operators" for Fourier integrals by 6t/(0 = X[-a,a](^/*)/(0 ™d a corresponding partial 
sum operator 6^ on Fourier series by letting the kih Fourier coefficient of &tf be equal 
to X[-\,\]{k/i)fk- We define a function z/ : No — )• No as follows: set = and for 
n > let v{n) be the smallest positive integer v for which \u > n. Notice that then 



(65) 



Snij^ifn,i{x) Snifn,i{x) — &u{ni^i) fn,i ^u{ni)fn,i- 



Now in order to prove (j64p it clearly suffices to verify it for the case that the function 
ffli and Q are trigonometric polynomials. The multipliers corresponding to &t are 
continuous at every integer. Thus by (j6T]) (applied with d = 1) and (j65]) we see that ([6] 
is implied by 



(66) 



K-l „i 



E J2 [ ['^u{ni+i){fn,iWR,p')-&^(^ni){fn,iWR,p')]QwRipdx 

0<ni<---<nK<M i=l ^ 



K-l 



n 1=1 



for sufficiently large R. 



Now notice that 6j = Sxt-S_xt so that the assumed LP'«i(M) U''''^{V'' ,R) bound- 
edness for the family {St} implies the analogous statement for the family {©t}. We 
run the duality argument in the reverse direction (now for functions defined on M) and 
deduce 



» K~l 

L E e(s [fn,^ 

•^^0<ni<-<nK<M i=l 



Wr/p'] - Gi,{n,)\fn,iWR/p>\)QwR/pdx 



K-l 



< 



E Ei/n^> 



Wr/p' 



n i=l 



By Lemma I A. 21 the right hand side of this inequality is for R > max{p, p'} bounded by 
the right hand side of (|66p . Thus we have established inequality (j66p and this concludes 
the proof. □ 
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B. A VARIATIONAL MeNSHOV-PALEY-ZyGMUND THEOREM 
For ^, x G M let 



C[m,x)= r e-^-'^-'f{x')dx'. 

J —oo 



-oo 

Menshov, Paley, and Zygmund extended the Hausdorff- Young inequality by proving a 
version of the bound 

(67) l|C[/]||^p'(^^)<Cp||/||iP(K) 

for 1 < p < 2. The bound at p = 2 is a special case of the much more difficult maximal 
inequality for the partial sum operator of proved by Carleson and Hunt. Interpolating 
the variational version, Theorem 11.21 aX p = 2 with a trivial estimate at p = 1, one sees 
that (|67p may be strengthened to the bound 



(68) <Cp,HUIlLP 

for 1 < p < 2 and r > p. It follows from the same arguments given in Section [2] that this 
range of r is the best possible. Our interest in this variational bound primarily stems from 
the fact, which will be proven in Appendix[Cl that it may be transferred, when r < 2, to 
give a corresponding estimate for certain nonlinear Fourier summation operators. The 
purpose of the present appendix is to give an easier alternate proof of (i68]l when p <2. 
Note that Pisier and Xu [27] have proved closely related U' — t- L'^{V^) inequalities for 
orthonormal systems of (not necessarily bounded) functions on an arbitrary measure 
space. 

A now-famous lemma of Christ and Kiselev [5] asserts that if an integral operator 

Tf{x)= [ K{x,y)f{y) dy 
JR 

is bounded from LP(M) to L'^{X) for some measure space X and some q > p, thus 

\\Tf\\LHX)<A\\f\\L,^^), 

then automatically the maximal function 

T*f{x) = sup / K{x,y)f{y) dy 

NgR Jy<N 

is also bounded from LP(M) to L'^{X), with a slightly larger constant. Another way to 
phrase this is as follows. If we define the partial integrals 



T<f{x,N)= / K{x,y)fiy) dy 

Jy<N 

then we have 

(69) ||r</||^,(^^) < Cp,,A||/||lp(m). 

As was observed by Christ and Kiselev, this may be applied in conjunction with the 
Hausdorff- Young inequality to obtain ()67p for p < 2. 

The norm can also be interpreted as the norm, and we will now see that 
can be replaced by for r > p, thus giving (j68p from the Hausdorff- Young inequality. 
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Lemma B.l. Under the same assumptions, we have 

\\'^<f\\Ll(V^) ^ Cp^q^rMf\\LP(R) 

for any r > p. 

Proof. This follows by an adaption of the argument by Christ and Kiselev, or by the 
following argument. Without loss of generality we may take r < q, in particular r < oo. 
We use a bootstrap argument. Let us make the a priori assumption that 

(70) \\T<f hliv;,) < ^^II/IIl.(R) 

for some constant < < oo; this can be accomplished for instance by truncating the 
kernel K appropriately. We will show that this a priori bound automatically implies the 
bound 

(71) \\T<f hiiv^) < i2'/^'^'/'''BA + Cp,,,rA)\\fh.iu) 

for some Cp^q^r > 0. This implies that the best bound B in the above inequality will 
necessarily obey the inequality 

B < + Cp,qy, 

since r > p, this implies B < Cp ^ .^. for some finite Cp g ^., and the claim follows. 

It remains to deduce ([71]) from ([70l) . Fix /; we may normalize ||/||lp(m) = 1. We find 
a partition point A^o in the real line which halves the norm of /: 



J 

J — c 



\f{yWdy= / IfivWdy 



1 



Write /„(y) = /(y)l(_oo,7Vo](y) and f+{y) = f{y)l[No,+oc){y), thus ||/-||lp(m) = 2 
and ||/+||2,p(M) = 2~^/P. We observe that 



T<fix,N) 



T<f^{x,N) when N< No 

Tf_{x) + T^f+{x, N) when iV > iVo 



Furthermore, T<:f^{x,-) and T</+(a;,-) are bounded in L°° norm by 0{T^f{x)). Thus 
we have 

\\T<f{xr)\\vr <{\\T<f.{xr)\\Vr+\\T<f+{xr)\\^^^^^^^ 

(The 0{Ttff{x)) error comes because the partition used to define ||T</(x, •)||y may have 
one interval which straddles Nq). We take L'^ norms of both sides to obtain 

r</llL|v- < ||(||T</_(^,-)lll.^ + \\T<f^{x,-)\\'vrY'^\Ll +0{\\%f\\Li). 

The error term is at most Cp^qA by the ordinary Christ-Kiselev lemma. For the main 
term, we take advantage of the fact that r < g to interchange the V and L'^ norms, thus 
obtaining 

r</llL|y^ < imfAYLivr^ + ||T</+||2,,.^)^/^ + 0{Cp,qA). 
By inductive hypothesis we thus have 

\\T<f\\Llvr < {i2~y^BAY + {2~'/PBAYy/^ + 0{Cp,qA), 
and the claim follows. 



□ 
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C. Variation norms on Lie groups 

In this appendix, we will show that certain r-variation norms for curves on Lie groups 
can be controlled by the corresponding variation norms of their "traces" on the Lie 
algebra as long as r < 2. This follows from work of Terry Lyons |23j : we present a self 
contained proof in this appendix. Combining this fact with the variational Menshov- 
Paley-Zygmund theorem of Appendix |Bl we rederive the Christ-Kiselev theorem on the 
pointwise convergence of the nonlinear Fourier summation operator for LP(M) functions, 
1 < p < 2. 

Let G be a connected finite-dimensional Lie group with Lie algebra g. We give q any 
norm || • ||g, and push forward this norm using left multiplication by the Lie group to 
define a norm ||x||TgG = Il5'~^2;||g on each tangent space TgG of the group. Observe that 
this norm structure is preserved under left group multiplication. 

We can now define the length \ j\ of a continuously differentiable path 7 : [a,6] — )• G 
by the usual formula 

l7l = / ll7'(t)l|T,(,)G dt. 

J a 

Observe that this notion of length is invariant under left group multiplication, and also 
under reparameterization of the path 7. 

From this notion of length, we can define a metric d{g,g') on G as 

d{g,g')= inf I7I 

-y:-y{a)=g,-y{b)=g' 

where 7 ranges over all differentiable paths from g to g' . It is easy to see that this does 
indeed give a metric on G. 

Given any continuous path 7 : [a, 6] — )■ G and 1 < r < 00, we define the r-variation 
WjHv of 7 to be the quantity 

n— 1 -^^^ 

||7||yr = sup ( V(i(7(tj+i),7(tj))' j 

a=to<ti<...<tn=b ^ ' 

where the infimum ranges over all partitions of [a, 6] by finitely many times o = ^i; 
. . . , tn = b. We can extend this to the r = 00 case in the usual manner as 

||7||yoo = sup sup (i(7(ij+i),7(ij)), 

a=to<ti<...<t„=b 0<j<n-l 

and indeed it is clear that the norm of 7 is simply the diameter of the range of 7. 
The norm of 7 is finite precisely when 7 is rectifiable, and when 7 is differentiable 
it corresponds exactly with the length I7I of 7 defined earlier. It is easy to see the 
monotonicity property 

||7||yp ^ Il7l|y whenever 1 < r < p < 00 
and the triangle inequalities 

(117111^- + 117211^/0^^'' < II71 +72||y < ||7i||v + 117211^ 

where 71 + 72 is the concatenation of 71 and 72 . A key fact about the norms is that 
they can be subdivided: 
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Lemma C.l. Let 7 : [a,b] G he a continuously differentiable curve with finite 
norm. Then there exists a decomposition 7 = 71 + 72 of the curve into two sub-curves 
such that 

Il7i||v'"5 ||72||y < 2~"^/''||7||yr. 

Proof. Let = sup{t G [a,b] : ||7|[a,t] ||y < 2"i/'"||7||v"-}. Letting 71 = 'y\[a,t,] we have 
||7i||y = 2~-'^/''||7||y . The bound for 72 = 7|[f,,6] follows from the left triangle inequality 
above. □ 



Given a continuously differentiable curve 7 : [a, b] — )■ G, we can define its left trace 
7i : [a, 6] — )• 6 by the formula 

= [ 7(s)"S'(s) ds 

J a 

Note that the trace is also a continuously differentiable curve, but taking values now in 
the Lie algebra g instead of G. Clearly ji is determined uniquely from 7. The converse 
is also true after specifying the initial point 7(a) of 7, since 7 can then be recovered by 
solving the ordinary differential equation 

(72) 7'(t)=7(t)7;(t). 

This equation is fundamental in the theory of eigenfunctions of a one-dimensional Schro- 
dinger or Dirac operator, or equivalently in the study of the nonlinear Fourier transform; 
see, for example, [32], [25] for a full discussion. Basically for a fixed potential f{t) and 
a frequency k, the nonlinear Fourier transform traces out a curve 7(t) (depending on k) 
taking values in a Lie group (e.g. SU{1, 1)), and the corresponding left trace is essentially 
the ordinary linear Fourier transform. 

It is easy to see that these curves have the same length (i.e. they have the same 
norm) : 

(73) |7l = h\. 

We now show that something similar is true for the norms provided that r < 2. 

Lemma C.2. Let 1 < r < 2, let G be a connected finite- dimensional Lie group, and let 
II • llg be a norm on the Lie algebra of G. Then there exist a constant C > depending 
only on these above quantities, such that for all smooth curves 7 : [a, b] — t- G, we have 

(74) WlWv- < hiWv- + C'min(||7;||yr, hzllyO 
and 

(75) llTillv < IItIIv + C'min(||7||yr, ||7||yr). 

An analogous result holds for the right trace, /* "f' {s)j{s)~^ ds, once the left-invariant 
norm on TgG is replaced by a right-invariant norm. 

Proof. We may take r > 1 since the claim is already known for r = 1 thanks to (j73p . 

It shall suffice to prove the existence of a small 5 > such that we have the estimate 

(76) ||7||y = WnWv^ + 0(117(111^) 
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whenever ||7j||y < S, and similarly 

(77) IItzIIv = IItIIf'- +0(11711^0 

whenever ||7||v'' ^ (We allow the 0() constants here to depend on r, the Lie group 
G, and the norm structure, but not on 5). Let us now see why these estimates will prove 
the lemma. Let us begin by showing that (j76p implies (j74p . Certainly this will be the 
case if 7; has norm less than 6. If instead 7^ has norm larger than 6, we can use 
Lemma IC . 1 1 repeatedly to partition it into 0(5~''||7i||yr) curves, all of whose norms 
are less than 6. These curves are the left-traces of various components of 7, and thus 
by ([76]) these components have a norm bounded by some quantity depending on 6. 
Concatenating these components together (using the triangle inequality) we obtain the 
result. A similar argument allows one to deduce (j75p from (j77p . 

Next, we observe that to prove the two estimates (f76]) . (f77|l it suffices to just prove 
one of the two, for instance (1 76 p . as this will also imply (I77p for ||7||v'' sufficiently small 
by the usual continuity argument (look at the set of times t for which the restriction of 7 
to [a, b] obeys a suitable version of (f77p . and use (|76P to show that this set is both open 
and closed if ||7||y is small enough). 

It remains to prove (|76[) for 6 sufficiently small. We shall in fact prove the more precise 
statement 

(78) ||log(7(a)-S(6))-7K&)llg <^ll7HI^ 

for some absolute constant K > (and for 6 sufficiently small), where log is the inverse 
of the exponential map exp : g — )■ G. Note that it follows from a continuity argument 
as in the previous paragraph that if 6 is sufficiently small then 7(6)~"'^7(a) is sufficiently 
close to the identity that the logarithm is well-defined. Let us now see why (178]) implies 
([76|) . Applying the inequality to any segment [tj,tj+i] in [a,b] we see that 

||log(7(tj)^S(ij+i)) - iliitj+i) - liitMs < K\hi\it^^t,+^]\\vr 
and hence (since 5 is small) 

d{l{tj+i),l{tj)) = ||7i(ij+i)-7i(ij)llg + 0(||7i|fe,t,+i]lly'-)- 

Estimating 0{\\-fi\[t^^t,+i]\\vr) crudely by ||7H|yC'(||7i|[tj,tj+i] llv) and taking the T sum 
in the j index, we see that for any partition a = Iq < . . . < tn = b we have 

(5^d(7(t,+i),7(t,))^) ^ + 0{\hfvr). 

j=o j=o 

Taking suprema over all partitions we obtain the result. 

It remains to prove ()78p for some suitably large K. This we shall do by an induction 
on scale (or "Bellman function") argument. Let us fix the smooth curve 7. We shall 
prove the estimate for all subcurves of 7, i.e. for all intervals [ti,t2] in [0',b], we shall 
prove that 

(79) II log(7(ti)-S(t2)) - {li{t2) - liitiMs < K\hi\[t,M\\v- 

Let us first prove this in the case when the interval [ti,t2] is sufficiently short, say of 
length at most e for some very small e (depending on 7). In that case, we perform a 
Taylor expansion to obtain 

(80) 7^*) = liih) + l'i{h){t - h) + ^-f'/ih){t - hf + o,i{t - hf) 
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and 

(81) ^lit) = 7Kti) + 7'i'ih){t - ti) + 0,((t - hf) 

when t S [ti,t2]i and where the 7 subscript in means that the constants here are 
allowed to depend on 7 (more specifically, on the norm of 7), and the 0() is with 
respect to the ||||g norm. Also we remark that as 7 is assumed smooth, 7;(ti) is bounded 
away from zero. It is then an easy matter to conclude that 

(82) Il7d II V > lHiti)h\t2 - h\ 

if e is sufficiently small depending on 7. On the other hand, from (j72p and (jSip we have 

I'it) = 7(t)(7[(ti) + l'i'{h){t - ti) + 0,{{t - t,f)) 
from which one may conclude that 

7(t) = 7(ti) exp(7;(ti)(t - h) + p"(ti)(t - hf + 0{H{h)\\t - ti|2) + O^iit - hf)) 
for all t G [^1,^2]) if 7 is sufficiently small. We rewrite this as 
log(7(ti)-S(i)) 

= l'i{ti){t - h) + l^l'{h){t - hf + 0{H{tr)X\t - tip) + 0,{{t - hf\ 
and then specialize to the case t = t2- By (jSOp . we have 

log(7(ii)-S(i2)) - (7/(^2) - liih)) = 0{H{t,)\\t2 - tip) + 0,((t2 - hf), 

and hence by ([82]) we have (|79l) if t2 — h is small enough (depending on 7) and K is 
large enough (independent of 7). 

This proves (j79p when the interval [ti,t2] is small enough. By (I82p . it also proves (j79p 
when ||7/|[ti,t2] ||y is sufficiently small. To conclude the proof of ([79]) in general, we now 
assert the following inductive claim: if (|79p holds whenever ||7«|[ji,j2] ||y^ < e and some 
given < e < (5, then it also holds whenever ||7i|[ti,t2] ||y < 2^/''e, providing that K 
is sufficiently large {independent of e) and 5 is sufficiently small (depending on K, but 
independent of e). Iterating this we will obtain the claim (|79p for all intervals [^1,^2] in 
[a, 6]. 

It remains to prove the inductive claim. Let [ti,t2] be any subinterval of [a,b] such 
that the quantity A = ||7/|[ti,t2] llv'' is less than 2^^^e. Applying Lemma [C.ll we may 
subdivide [ti,t2] = [^1,^*] U [t*,t2] such that 

Il7/I[ti,t.]l|y, Il7«l[t*,t2]llv^'- < 2"^/''^ < e < r. 
By the inductive hypothesis, we thus have 

||log(7(ti)-S(t*)) - MQ-7i{ti)% < K2~^''A^ 

and 

II log(7(t*)-S(t2)) - (7/(^2) - Ilium, < K2-^'''A\ 
In particular, we have 

II Iog(7(ti)-S(t*)ll0 < hliU) - 7/(ii)llg + K2-^'''A^ 
< hi\[tut4vr+OiKA^) 
= 0{A{1 + KA)) = 0{A{1 + K6)) = 0{A) 
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if 6 is sufficiently small depending on K. Similarly we have 

l|l0g(7(t*)"S(t2)||g=0(^) 

and hence by the Baker-Campbell-Hausdorff formula (if 5 is sufficiently small) 

II log(7(ti)-S(t2) - log(7(ii)-S(t*) - Iog(7(t*)-S(t2)||0 = 0{A^). 
By the triangle inequality, we thus have 

II log(7(ti)"S(t2)) - Mt2) - 7/(ti))ll0 < + 0{A^). 

We now use the hypothesis r < 2, which forces 2 x 2"^/'' < 1. If K is large enough 
(depending on r, but independently of 5, A, or e) we thus have ([79]) . This closes the 
inductive argument. □ 

Letting w,v he any elements of the Lie algebra q, one can define a nonlinear Fourier 
summation operator associated to G, w, v by means of the left trace 

MC[f]ik,0)=I 

^m[f]{k,x)=m[f]ik,x) (Re(e-2-'=^/(x))«; + Im(e-2-'=-/(x))z;) 

or (giving a different operator) by the right trace 
MC[f]{k,0)=I 

-^MC[f]{k,x) = (Re(e-2^*'=^/(2;))u; + Im(e-2^^*^^/(x))w) AAC[/](A;,x). 

Above, k,x gM, J\fC[f] takes values in G, I is the identity element of G, and Re,Im are 
the real and imaginary parts of a complex number. An example of interest is given by 
G = SU{1,1), and 

/ 1 \ / i 

= ^ 1 J ' ^ = ^ -z 

Combining Lemma IC.21 with the variational Menshov-Paley-Zygmund theorem of the 
previous section, we obtain a variational version of the Christ-Kiselev theorem [6]. 
Namely, we see that for 1 < p < 2 and r > p 

ll^|A^C[/]|<l-^C[/]||^p/^^^^^ < Gp^r,G,w,v\\f\\LP(M.) 

and 

||l|A^C[/]|>lA/'C[/]|| ,/ < Gp^r,G,w,v\\f\\LP{R)- 

Note that the usual logarithms are hidden in the d metric we have placed on the Lie 
group G. 

Extending these estimates to the case p = 2 is an interesting and challenging problem, 
even when r = oo, which would corresponds to a nonlinear Carleson theorem. Lemma 
IC.2I cannot be extended to any exponent r > 2. Sandy Davie and the fifth author of 
this paper have an unpublished example of a curve in the Lie group SU{\, 1) with trace 
in the subspace of su(l, 1) of matrices vanishing on the diagonal so that the diameter of 
the curve is not controlled by the 2-variation of the trace. 

Terry Lyons' machinery [23] via iterated integrals faces an obstruction in a potential 
application to a nonlinear Carleson theorem because of the unboundedness results for 
the iterated integrals shown in |26| . 
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D. An application to ergodic theory 



Wiener- Wintner type theorems is an area in ergodic theory that is most closely related 
to the study of Carleson's operator. In [19], Lacey and Terwilleger prove the following 
singular integral variant of the Wiener- Wintner theorem: 

Theorem D.l. For 1 < p, all measure preserving flows {Tt : t G M} on a probability 
space {X,fj,) and functions f G L^(fM), there is a set Xf C X of probability one, so that 
for all X E Xf we have that the limit 

lim/ e^''f{Ttx)^ . 

exists for all 6 £M. 



One idea to approach such convergence results is to study quantitative estimates in 
the parameter s that imply convergence, as pioneered by Bourgain's paper jl] in similar 
context. We first need to pass to a mollified variant of the above theorem: 

Theorem D.2. Let (f) be a function on M in the Wiener space, i.e. the Fourier transform 
(j) is in L^(M). For 1 < p, all measure preserving flows {Tt : t G M} on a probability space 
{X,fi) and functions f £ L^{^), there is a set Xf C X of probability one, so that for all 
X £ Xf we have that the limits 

lim / e'''f{Ttx)4>{st)^ , 
lim / e^<^'f{Ttx)4>ist)^ . 

exist for all 6 gM. 



sup 



e'''f{Ttx)^{sty- 



LP{x) 



<C\\f\\ 



LP 



This theorem clearly follows from an a priori estimate 

for r > max(2,p'). Here we have written V^'{s) for the variation norm taken in the 
parameter s of the expression inside, and likewise for U'{x). The variation norm is the 
strongest norm widely used in this context, while Lacey and Terwilliger use a weaker 
oscillation norm in the proof of their Theorem. 



By a standard transfer method ([3], [7]) involving replacing / by translates Tyf and 
an averaging procedure in y, the a priori estimate can be deduced from an analogous 
estimate on the real line 



(83) 



sup 



f{x + t)cl){st)j 



LP{x) 



< 



LP 



The main purpose of this appendix is to show how this estimate (j83p can be deduced 
from the main theorem of this paper by an averaging argument. We write the V'^{s) 
norm explicitly and expand (j) into a Fourier integral to obtain for the left hand side of 



83|) the expression 



sup sup 

f SO<Sl<---<SK 



K 

E 

fc=i 



l/r 



LP{x) 
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Now pulling the integral in rj out of the various norms and considering only positive tj 
(with the case of negative r] being similar) and defining Cfc = ? + V^k we obtain the upper 
bound 

/ sup / e^(?^-«'=-)V(x + t)^ Y^' ^ ^imidv ■ 

Now applying the variational Carleson estimate and doing the trivial integral in rj bounds 
this term by a constant times ||/||lp. 

Remark D.3. To prove the Lacey-Terwilleger theorem ID.ll from the mollified version, 
one may approximate the characteristic functions used as cutoff functions by Wiener 
space functions so that the difference is small in norm. Then at least for / in L°° one 
can show convergence of the limits by an approximation argument, even though one will 
not recover the full strength of the quantitative estimate in the Wiener space setting. 
The result for / in L°° can then be used as a dense subclass result in other spaces, 
which can be handled by easier maximal function estimates and further approximation 
arguments. 

Remark D.4. The classical version of the Wiener- Wintner theorem does not invoke sin- 
gular integrals but more classical averages of the type 

1/ e^''f{Ttx)dt . 
We note that the same technique as above may be applied to these easier averages. 
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A VARIATION NORM CARLESON THEOREM 



RICHARD OBERLIN ANDREAS SEEGER TERENCE TAG 
CHRISTGPH THIELE JAMES WRIGHT 

Abstract. We strengthen the Carleson-Hunt theorem by proving U' estimates for 
the r-variation of the partial sum operators for Fourier series and integrals, for r > 
max{p', 2}. Four appendices are concerned with transference, a variation norm Menshov- 
Paley-Zygmund theorem, and applications to nonlinear Fourier transforms and ergodic 
theory. 



1. Introduction 

For an integrable function / on the circle group T = M/Z, and k £ Z we denote by 
~ So f{y)^~'^^^^^ '^y Fourier coefficients and consider the partial sum operators 
Sn for the Fourier series, 

n 

(1) Snf{x) = S[f]{n,x) = Ae'"*'"; 

k=—n 

here n G No = {0, 1, 2, . . . }. The celebrated theorem by Carleson [1] states that if / is 
square integrable then Snf converges to / almost everywhere. Hunt [13] extended this 
result to LP(T) functions, for 1 < p < oo, and proved the inequality 

(2) II sup \Snf\ < C ||/||lp(t) 

for all / G LP(T); see also [TU], [2D], and [T2] for other proofs of this fact. 

The purpose of this paper is to strengthen the Carleson-Hunt result for functions, 
1 < p < oo, and show that, for r > max{2,p'}, the (strong) r-variation of the sequence 
{Snf ix)}n£No is finite for almost every x G [0, 1]. This can be interpreted as a statement 
about the rate of convergence. To fix notation, we consider real or complex valued 
sequences {a„}„gNo and define their r-variation to be 




where the sup is taken over all K and then over all increasing sequences of nonnegative 
integers no < • • • < nx- Note that the variation norms are monotone decreasing in 
the parameter r. Next, for a sequence F = {Fn} of Lebesgue measurable functions one 
defines the r-variation of -F at x, sometimes denoted by V^F^x) as the norm of the 
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sequence We denote the r-variation of the sequence F„ = Snf by V^S[f]. The 

variation norms and the r-variation operator can be defined in a similar fashion, if the 
index set Nq is replaced by another subset of M (often R"*" or M itself). 

Let r' := r/(r — 1), the conjugate exponent of r. 
Theorem 1.1. Suppose r > 2 and r' < p < oo. Then, for every f £ LP{T), 

(4) \\S[f]\\^,^y^^<C,JfU.. 

At the endpoint p = r' a restricted weak type result holds; namely, for any 
the function VS[f] belongs to L'^''°°(T). 

It is immediate that for r < oo implies a quantitative form of almost everywhere 
convergence of Fourier series, improving over the standard qualitative result utilizing the 
weaker r = oo inequality and convergence on a dense subclass of functions. 

As will be discussed in Section [2l the conditions on the exponents in 1^ are sharp. 
Moreover, in the endpoint case p = r' the Lorentz space cannot be replaced by a 

smaller Lorentz space. 

By standard transference arguments (see Appendix [A]) Theorem 11.11 is implied by a 
result on the partial (inverse) Fourier integral of a Schwartz function / on M is defined 

as 

s[fmx)= f me-'-'^^dv 

J — oo 

where f{rj) = J /(y)e~^'^*^'' dy defines the Fourier transform of /. 

Theorem 1.2. Suppose r > 2. Then S extends to a bounded operator S : ^ L^(y) 
for r' < p < oo. Moreover S maps U'^^ boundedly to 

Note that if in the above definition of the mixed L^iV^) spaces we interchange the 
order between integration in the x variable and taking the supremum over the choices of 
K and the points to so that these choices become independent of the variable x, 
then the estimates corresponding to Theorem 11.21 are weaker; they follow from a square 
function inequality of Rubio de Francia [31] for p>2, see also [30] for a related endpoint 
result for p < 2, and |18j for a proof of Rubio de Francia's inequality which is closer to 
the methods of this paper. 

While the concept of r-variation norm is at least as old as Wiener's 1920s paper on 
quadratic variation |35j . variational estimates have been pioneered by D. Lepingle (|22j) 
who proved them for martingales. Simple proofs of Lepingle's result based on jump 
inequalities have been given by Pisier and Xu [27] and by Bourgain [1] , and applications 
to other families of operators in harmonic analysis such as families of averages and 
singular integrals have been considered in [1], and the subsequent papers [TJ], [2], [15] 
(c/. the bibliography of [15] for more references). Bourgain [1] used variation norm 
estimates (or related oscillation estimates which are intermediate in difficulty between 
maximal and variation norm estimates) to prove pointwise convergence results without 
previous knowledge that pointwise convergence holds for a dense subclass of functions. 
Such dense subclasses of functions, while usually available in the setting of analysis on 
Euclidean space, are less abundant in the ergodic theory setting. In Appendix |D] we 
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demonstrate the use of Theorem 11.21 in the setting of Wiener- Wintner type theorems 
as developed in [19]. We note that the Carleson-Hunt theorem has previously been 
generalized by using other norms in place of the variation norm, see for example the use 
of oscillation norms in [19], and the norms in [8], [9]. 

We are also motivated by the fact that variation norms are in certain situations more 
stable under nonlinear perturbation than supremum norms. For example one can deduce 
bounds for certain r-variational lengths of curves in Lie groups from the corresponding 
lengths of the "trace" of the curves in the corresponding Lie algebras, see Appendix [Cl 
for definitions and details. What we have in mind is proving Carleson type theorems for 
nonlinear perturbations of the Fourier transform as discussed in [25] , |26j . Unfortunately 
the naive approach fails and the ultimate goal remains unattained since we only know 
the correlation between lengths of the trace and the original curve for r < 2, while the 
variational Carleson theorem only holds for r > 2. Nonetheless, this method allows one 
to see that a variational version of the Christ-Kiselev theorem [6] follows from a varia- 
tional Menshov-Paley-Zygmund theorem which we prove in Appendix[Bj The variational 
Carleson inequality can be viewed as an endpoint in this theory. 

Our proof of Theorem 1 1 . 2 1 will follow the method of [20] as refined in |12j . Naturally one 
has to invoke variation norm results in the setting of individual trees, which is achieved 
by adapting D. Lepingle's result ([22]) to the setting of a tree. The authors initially had 
a proof of the case p > 2 and r > p of Theorem 11.21 more akin to [20] , while improvement 
to r > 2 for such p provided a stumbling block. This stumbling block was removed 
by better accounting for trees of given energy, as described in the remarks leading to 
Proposition 14. 3i In Section [3] we reduce the problem to that of bounding certain model 
operators which map / to linear combinations of wave-packets associated to collections 
of multitiles. In Section[5]we bound the model operators when the collection of multitiles 
is of a certain type called a tree; this bound is in terms of two quantities, energy and 
density, which are associated to the tree. These quantities are defined in Section U] and 
an algorithm is given to decompose an arbitrary collection of multitiles into a union of 
trees with controlled energy and density. Section [6] contains two auxiliary estimates. All 
these ingredients are combined to complete the proof in Section [71 

The authors would like to thank the anonymous referee for valuable comments on the 
submitted draft of the paper. 

Some notation. For two quantities A and B let A < B denote the statement that A < 
CB for some constant C (possibly depending on the parameters p and r). The Lebesgue 
measure of a set E is denoted either by \E\ or by meas(-E). The indicator function of E 
is denoted by \e- For a subset C M and a G M we set a + E = E + a = {x : x — a ^ E}. 
If / is a finite interval / with center c(/) we denote by C/ the C dilate of / with respect 
to its center, i.e. the set of all x for which c(I) -|- g /. 

2. Optimality of the exponents 

Since Theorem 11.21 implies Theorem 11.11 ( cf. Appendix [A]) we have to discuss the 
optimality only for the Fourier series case. The necessity of the condition r > 2 follows 
from a corresponding result for the Cesaro means; its proof by Jones and Wang |16j was 
based on a probabilistic result of Qian [29]. 
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We show the necessity of the condition p > r' in Theorem 11.11 Let 

^ ^2nikx ^ sin((2n + 1)ttx) 



sm vrx 

k=-n ^ ' 



the Dirichlet kernel, and let / be the de la Vallee-Poussin kernel which is defined by 
= 2K2N+1 - Kn via the Fejer kernel Kn = {N + 1)-^ EjLo^i- Then [p^]k = 1 
for |A:| < iV + 1 and thus 5„/^ = L>„ for \n\ < N + 1. We have ||/^||li(t) = 0(1), and 
ll/^lloo = 0{N) and therefore ||/^||lp..(t) = 0{N^-'/p). 

Let N > 10^ and 8N^^ <x<l/8. Let K = K{x) be the largest integer < Nx. Then 
for < A; < 2K{x) there are integers nk{x) < N so that (2nfc(x) + l)x G (| + A;, | + k), 
in particular nfc(x) < nk^i{x) for k < 2K{x) — 1. Observe sin((2n2j(2;) + 1)ttx) > \/2/2 
and sin((2n2j+i(x) + l)7rx) < -^2/2 for < j < K{x) - 1. This gives 

,1/, i^W-i 



( X] l'5n2^+i(x)/^-'5„2^(x)/^r) -( \D., 
j=o j=o 



sin(7rx) 

and this implies for large N 

-^''''\(logiV)i/^ ifp = r'. 

Thus the — >• L^(y) boundedness does not hold for p < r'; moreover the L'^''^ 
L^'''^(y) boundedness does not hold for s < oo. 



3. The model operators 



We shall show in appendix lAl how to deduce Theorem 11.11 from Theorem ll.21 To start 
the proof of the main Theorem 11.21 we describe some reductions to model operators 
involving wave packet decompositions. 

First, by interpolation it suffices to prove for p > r' the restricted weak type L^'^ — )• 
LP'°°(V^) bound. Next, by the monotone convergence theorem it suffices to estimate 
LP'°°(yr) on finite x-intervals [— ^, with constant independent of A. By another 
application of the monotone convergence theorem it suffices, for any fixed K, to prove 
the LP'^ LP'°°([-^,^]) bound for 

K ^ , 

(5) sup (y2\Sf{^e,x)-Sf{Ci-i,x)\'') 

where the sup is taken over all (^O) ■ ■ ■ > Ck) with < for £ = 1, . . . , ii'. Moreover, 
by the density of Schwartz functions in L^'^ it suffices to prove a uniform estimate for 
all Schwartz functions. Note that for any Schwartz function / the expression S[f]{(,,x) 
depends continuously on {(,,x). Therefore it suffices to bound the expression analogous 
to ([5]) where we impose the strict inequality < ^£ for £ = 1, . . . , if. Moreover, by the 
continuity it suffices for each finite set H C M to prove bounds for this expression under 
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the assumption that the belong to H, and we may also assume that H does not contain 
any numbers of the form n2™ with m,n G Z (i.e. no endpoints of dyadic intervals). 

We may now linearize the variation norm. Fix K gN, measurable real valued functions 
£,oix) < . . . < iK{x), with values in H and measurable complex valued functions 01(2;), 
. . ., axix) satisfying 

\ai{x)f + ... + \aK{xY = 1. 

Let 

K 

Sy]{x)=Y,{S[mu{x),x)-S[mk-i{x),x))ak{x). 

k=l 

Theorem 11.21 will now follow from the estimate 

(6) ||5'[/]||^„^(j,)<C||/||ip,i(M) 

where C is independent of H and the linearizing functions, and where / is any 
Schwartz function. Finally, in order to prove ([6]) for any fixed H we may assume that / 
has compact support in M \ H since the space of Schwartz functions with this property 
is dense in L^'^, 1 < p < 00. 

Let V = {[2*^m, 2''(m + 1)) : m,k G Z} be the set of dyadic intervals. A tile will be 
any rectangle I x to where I,uj are dyadic intervals, and \I\\uj\ = 1/2. We will write S' as 
the sum of wave packets adapted to tiles, and then decompose the operator into a finite 
sum of model operators by sorting the wave packets into a finite number of classes. For 
each k, 

To suitably express the difference above as a sum of wave packets, we will first need 
to construct a partition of adapted to certain dyadic intervals. The fact that 

has two boundary points instead of the one from (— oo,^fc) will necessitate a 
slightly more involved discretization argument than that in |20| . 

For any ^ < let J^^g/ be the set of maximal dyadic intervals J such that J C 
and dist(J,0 > |J|, dist(J,^') > |J|. Let 1/ be a C°° function from M to [0,1] which 
vanishes on (— 00,— 10~^], is identically equal to 1 on [10~^,oo), and so that i^'ix) > 
for — 10~^ < X < 10~^. Given an interval J = [a,b), and i £ {—1,0, 1}, define 

Thus if c(J) = the center of J, then 

(7) ^jAO = M^-^) where ^rj) = u{2~\^ + \)) - u{r^ - \) , 

and we notice for i E {—1,0, 1} both and are C°° functions supported in [— ^, ^] 
(more precisely in [— ^])- Hence ifj^i is supported on a ||-dilate of J with respect 
to its center. 

For each J G J^,^', one may check that there is a unique interval J' G J^^g' which lies 
strictly to the left of J and satisfies dist(J', J) = 0, and one may check that J' has size 
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|J|/2, |J|, or 2\J\. We define ipj = where i{J) is chosen so that \ J'\ = 2*('^)|J|. 

Then 

(8) hu')i^)m= E ^mKv)- 

Since we assume that / is compactly supported in M \ H we see that that for every pair 
^ < ^' with ^, ^' S H only a finite number of dyadic intervals J G J^^^' are relevant in 

We now write each multiplier ipj as the sum of wave packets. For every tile P = I x J, 
define 

Mx) = \I\'^'j'-'[V^{x-c{I)) 

where c(/) is the center of / and J-~^ denotes the inverse Fourier transform. For each 
J, we then have 

(9) E {f^^Ixj)4>IxJ = f^j- 

l^l=l/(2|J|) 

To see this we use a Fourier series expansion (c/. |34j). We first observe that 4>p{^) = 
\/RT V (pj{^)e~'^'^^^^^^^ and use {f,(f>p) = (/,^p). Now let parametrize the centers of the 
dyadic intervals / of length L by -{k-\)L, k G Z. Set gj{uj) := [^/^/](c(J)+L-ia;)e™ 



2 

and note that 5j is supported in [— ^n, -M]- The left hand side of ([9]) is equal to 



7,. ^ 



2m{kL-^)e, 



, [-1/2 

which gives ([9]). This in turn yields the representation of S'[f] in terms of wave packets: 

K 

(10) s'[f]{x) = j2{ E E {f^^ixj)^ixj)ak{x). 

k=l JGJe,_i(x),€fe(.) |/I = (1/{2|J|)) 



For the function / under consideration the above Fourier series expansion converges 
in L^-Sobolev spaces of arbitrary high order and thus the convergence in pop is uniform 
for X G [— Therefore it suffices, for any finite family *p of tiles, to consider the 
operator 5" defined by 

K 

(11) 5"[/](x) = E( E {f,4>ixj)4>ixjix))ak{x). 

k=i {/,J)e'P 

■^^■'«fe-i{^).«fe(^) 



The wave packets will be sorted into a finite number of classes, each well suited for 
further analysis. Sorting is accomplished by dividing every into a finite number of 
disjoint sets. These sets will be indexed by a fixed subset of {1,2,3} x {1,2,3, 4}^ x 
{left, right}. Specifically, for each {m,n, side) £ {1,2,3,4}^ x {left, right}, we define 

• Jc,i',(i,m,n,side) = {J £ V : J C (C,C')i C is in the interval J - (m + 1)| J|, ^ is in 
the interval J + (n + 1)| J|, and J is the side-child of its dyadic parent}. 
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• ^^,i',(2,m,n,side) = {J & V : J C {^,^'), ^ is ill the interval J - {m + 1)\J\, 
dist(^', J) > n\J\, and J is the side-child of its dyadic parent}. 

• Jw,{3,m,n,side) = {J € V : J C (^,^')> dist(^, J) > m\J\, is in the interval 
J + (n + 1)1 J|, and J is the side-child of its dyadic parent}. 

We will choose R C {1,2,3} x {1,2,3,4}^ x {left, right} so that for each the 
collection {J^.^'.p}peR is pairwise disjoint and J^^^/ = Upg/jJ^^^/^p. We will also assume 
that for each p G R there is an i{p) e {—1, 0, 1} such that \ J'\ = 2*('')| J| for every ^ < 
J £ J^.^',p and J' G J^_^/ with J' strictly to the left of J and dist(J, J') = 0. One may 
check that these conditions are satisfied, say, for 

R = {(1, 2, 1, left), (1, 2, 2, left), (1, 3, 1, left), 

(1, 3, 2, left), (2, 1, 1, left), (2, 1, 1, right), (2, 2, 1, right), 

(3, 4, 1, left), (3, 3, 1, right), (3, 4, 2, left)}. 

It now follows that 

peR 

where 




5fc_l(a:),efc(a:),P 



It will be convenient to rewrite each operator in terms of multitiles. A mul- 
titile will be a subset of M? of the form I x io where / G D and where to is the 
union of three intervals w;, i^^, in V. For each p = {i,m,n, side) £ R, we con- 
sider a set of /9-multitiles which is parameterized by {{I,uJu) '■ I-,^u G ^, l-^H'^ul = 
1/2, and Uu is the side-child of its parent}. Specifically, given oj^ = [a, 6) 

• If /9 = (1, m, n, side) then uji = uju — {m + l)|a;u| and ojh = uj^ + {n + l)\uju\- 

• If /9 = (2, m, n, side) then uji = uj^ — {m + l)|a;u| and w/i = [a + (n + l)|a;u|, oo). 

• If p = (3, m, n, side) then w/ = (— oo, 6 — (m + l)|t^u|) and w/^ = a;„ + (n + l)|tJu|- 

For i = 1, 2, 3 we shall say that p is an i-index if p = (i, m, n, side). For every p-multitile 
P, let ap{x) = afc(x) if /e satisfies 1 < k < K and ^fc_i(x) S ui and Cfc(x) E cj/i (such a 
would clearly be unique), and ap{x) = if there is no such k. Then, 

SV]i^)= {fAp)M^)ap{x) 

PGPp 

where, for each p-multitile P, 4>p{x) = \/\I\J^~^[^/^fZ~i{pj]{x — c(/)) and Pp denotes the 
set of all p- multitiles for which I x oju belongs to 

Inequality Q and hence Theorem 1 1 . 2 1 will then follow after proving the bound 

(12) mf]\\L.-^<\\f\\LP-^ 

for each p £ R. We shall only give the proof of this estimate for the case that p is a 
1-index or p is a 2-index, and the case where p is a 3-index can be deduced by symmetry 
considerations. Indeed, if P = {I,u)u) ■= ([^J, ^)5 [c, d)) and P := ([— 6, a), [— d, c)) then 
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{f,(pp)(pp = (/( — )-,4>p)4>p where, in the definition of the 0p the function Vi in ([7]) 
is replaced with •) (both are supported in (— ^i^))- Now reflection sends a half 
open interval [a, 6) to a half open interval (—6, —a], however this plays no role in our 
symmetry argument if, as we do, we assume that the set H does not contain endpoints 
of dyadic intervals. We then see that the estimation of S^lf] for p = {3,m,n, side) is 
equivalent to the estimation of a 5'^[/(— •)] where the corresponding set {P} of multitiles 
is replaced with a set {P} of index p = (2, n, m, opposite side) and the set H is replaced 
with : — ^ G H}. Beginning with (jl4p both or fj(— •) are allowed in the definition of 
the functions ifj^i and (pp. 

By the usual characterization of L^'^ as superpositions of functions bounded by char- 
acteristic functions it suffices to show that 

meas({x : \SP[f]ix)\ > A}) < CPX-p\F\ 

where F C M, |F| > 0, |/| < If, A > 0, 2 < r < oo, and r' < p < (1/2 - l/r)-i. This is 
accomplished by proving that for every measurable E cM, 

(13) meas({x G E : \SP[f]{x)\ > C(|F|/|i?|) '/^}) < \E\/2. 

Indeed, if we set Ex = {x : \Sf[f]{x)\ > A} then by the finiteness of the set of tiles 
under consideration the set Ex has a priori finite measure. If \Ex\ < CPX~^\F\ then 
there is nothing to prove. If the opposite inequality \Ex\ > C^A~^|F| were true then A > 
C{\F\/\Ex\)^^P and inequality ^ applied to = would yield that \Ex\ < \Ex\/2, a 
contradiction. 

We finally note that, after possibly rescaling, we may assume that 1 < |£^| < 2 in (llSp . 
The next four sections will be devoted to the proof of inequality (|13p in this case. 



4. Energy and density 

Recall that S^[f]{x) = '^p{f, 4>p)4'p'^p where P ranges over an arbitrary finite col- 
lection of /?-multitiles, p is a 1 or 2-index. It is our goal to show (jl3p and for this and 
the next chapter we fix the function / with |/| < 1^ and the set E. 

Fix 1 < Cs < C2 < Ci, with C2 G N, such that for every multitile P, 

supp(^p) C C3UJU 

C2UJU n C2UJ1 = 0, C2UJU n w/i = 0, 

C2OJ1 C Cioju, C20JU C ClOJi; 

recall that dilations of finite intervals are with respect to their center. One may check 
that the values C3 = 11/10, C2 = 2, and Ci = 12 satisfy all these properties. 

The wave packet is adapted to the multitile P. As (pp is compactly supported (in 
C^uju) the function (j)p cannot have compact support, but as a replacement we have the 
following bounds involving 

„H.,:^^(l.^)-" 
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for a fixed large ^ 10; namely 
dx 



(14) i!_(exp(-2^ic(a;J •)<Ap)(x) < C'{n)\I\('/^^-^^\wj{x)\ 



for each n > 0. 

We are working with a given finite set of p-multitiles P (with p a 1- or 2-index) and 
we let Mo = Mo(P) be the smallest integer M for which all tiles are contained in the 
square [—2*^,2*^]^. Throughout this paper we fix 

H*°P = {rj : \r]\ < Ci2*^°+^°, r] = n2~*^°~^°, for some n G Z } 

as the set of admissible top-frequencies for trees, as in the following definition. 



Definition. Consider a triple T = {T-j- , I-j- , (,t) j with a set of multitiles T-j-, a dyadic 
interval If C [—2^'^° , 2^^° ) and a point ^7- G H*°p . We say that T is a tree if the following 
properties are satisfied. 

(i) I Cir for all P = (/,a;„) G Tf. 

(a) If P = {I,u}u) and ujm denotes the convex hull of C20JU U C2W; then 

■■= [er-f^,er+^) 

is contained in Um- 

We refer to Ij- as the top interval of the tree, and to ^7- as the top frequency of the 
tree. 

In order not to overload the notation we usually refer to the set T as "the" tree 
(keeping in mind that it carries additional information of a top frequency and a top 
interval), and we shall also use the notation It, and ujt in place of I-j-, ^7- and uif. 
With this convention we also define 

Definition. 

(i) A tree (T,It,^t) is l-overlapping if 6.T G ^2^;; for every P £T. 
(a) A tree {T,It,Ct) is l-lacunary if £,t C2UJ1 for every P 

Notice that the union of two trees with the top data It,£,t is again a tree with the 
same top data. Also, the union of two /-overlapping trees with the same top data is 
again an /-overlapping tree with the same top data. 

We split our finite collection of multitiles into a bounded number of subcollections 
satisfying certain separation conditions (i.e. henceforth all multitiles will be assumed to 
belong to a fixed subcoUection). 

Separation assumptions. 

(15) If P,P' satisfy \uj'J < \uju\, then \uj'J < —— — |a;„|. 

(16) If P, P' satisfy Ciuju H Ciuj'^ / and \uju\ = then Uu = ^'u- 
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As immediate but important consequence of the separation assumptions is the fre- 
quently used 

Observation 4.1. Let T be a tree satisfying the separation properties (|15p and (|16p . 

Then the following properties hold. 

(i) If T is an l-overlapping tree, P, P' G T, and \uj'J\ < \ojy\ then C^uju H 02^^^ = 0. 
(a) If T is an l-lacunary tree, P, P' £ T, and \uj'y\ < \oJu\ then C^uji H Csuj^ = 0. 
(Hi) IfP,P' £T, P^ P', and \uju\ = {ui'J, then I D I' = 0. 



As in previous proofs of Carleson's theorem (in particular |20j ) we shall split the set 
of multitiles into subsets with controllable energy and density associated to the function 
/ and the set E, respectively. Here we work with the following definitions. 

Definition. Fix f G L^(M) and a measurable set E C M.. Given any collection of 
multitiles P we define 



■gy(P) = sup j^X^I^/''^^)!' 



ener^ 

T 



where the sup ranges over all l-overlapping trees T C P. 
Given a measurable set E cM. we set 



/I /■ / \x - c(It)\\-'^-J^ I 
density(P) = sup (l + J \!^) Yl l"'^^^)!'' l-rfe-iC^)) dx\ 



k=l 

where the sup is over all non-empty trees T C P. 

Remark. Concerning the terminology, one can argue that the squareroot should be omit- 
ted in the definition of an energy. However we work with the above definition to conform 
to |20j and other papers in time-frequency analysis. 



Lemma 4.2. Let \ f\ be bounded by Ip. For any family P of multitiles the density o/P 
(with respect to the set E) and the energy (with respect to f) are bounded by a universal 
constant. 



Proof. Clearly the density is bounded by f^{l + dx < 3. Concerning the energy 

bound we let T be any /-overlapping tree, and split f = f + f" where /' = Is/^/- 
We estimate Eperlif ,<Pp)\'' < ll/'IU^ (Eper K/'> 'Ap)!')'/', by the Cauchy-Schwarz 
inequality. Now use that the supports of the (pp are disjoint for different sizes of frequency 
intervals, and then, for a fixed size, use the bounds (jl4p (for n = 0) to see that 

2 ^1/2 ^ ^ ^ ^ ^\l/2 



L2 



Per jez Per pgt 

Hence, 

Per 
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Furthermore, since \f"\ < we have the estimate 

\{f",M\ < 1/1^/^(1 + dist(/,M\3/T)/|/|)"(^"') < \I\'/\\I\/\lT\f"' 
Summing in P, we obtain 

Per 

Combining the estimates for /' and /" we see that X^pst K/' ~ 1-^^"! for every 
Z-overlapping tree and it follows that the energy of P with respect to / is bounded above 
by a universal constant. □ 



The following proposition allows one to decompose an arbitrary collection of multitiles 
into the union of trees, where the trees are divided into collections Tj with the energy 
of trees from Tj bounded by . The control over energy is balanced by an L'^ bound 
for the functions Nj^£ := X^^^gt It' contrast to [20] and [12], it is necessary here 
to consider q > 1 and ^ > in order to effectively use the tree estimate Proposition 15.1 
with q > 1. Note that such L*^ bound for Nj^i is established by combining (jl7p and (jlSp 
below. The bound ([20]) permits one to make further decompositions to take advantage 
of large \F\ in the L"? bound for the Nj^£ while maintaining compatibility with bounds 
for trees with a fixed density obtained from Proposition 



Proposition 4.3. Let £ > 0, let \ f\ be bounded above by Ip and let F be a collection 
of multitiles with energy bounded above by £. Then, there is a collection of trees T such 
that 

(17) J;|/t|<^:-VI 

TeT 

and 



energ 

and such that, for every integer i >0, 
(18) II E 1 



:y ( P \ U ^ ) ^ ^/^^ 

\ TGT / 



Tgt 



< 2''£ 



2£c-2 



BMO 

Furthermore, if for some collection of trees T' , 

(19) P= IJ T' 

T'GT' 

then 

(20) < E i^^'i- 

Tgt T'GT' 



Above, and subsequently, || • || _bj\/o denotes the dyadic BMO norm. 



Proof. We select trees through an iterative procedure. First, if energy(P) < £/2 then 
no tree is chosen and T = 0. 
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If energy (P) > iS/2 then we observe that there is an /-overlapping tree S C P for 
which 

(21) ^ J^K/,0P)P>£:V4. 

There is only a finite number of such trees and as Si we choose one for which the top 
datum ^5 is maximal (in M). Note that the maximality can be achieved as we restrict 
all top frequency data to the finite set H*°p. Let Ti be the tree in P which has top data 
iS.Si 1 Isi ) cind which is maximal with respect to inclusion. 

Suppose that trees Sk,Tk have been chosen for /c = 1, . . . , j. Set 

p,=p\Ut, 

k=l 

If energy(Pj) < £/2 then we terminate the procedure, set T = {Ti:}i<k<j and n = j. 
Otherwise, we may find an /-overlapping tree S C Pj such that (I2ip holds. Among 
/-overlapping in Pj satisfying ()2ip choose one with maximal top- frequency (in and 
label this tree Sj+i. Let Tj+i be the maximal tree in Pj which has top data {£,Sj+i > ^Sj+i ) 
and which is maximal with respect to inclusion. This process will eventually stop since 
each Tj is nonempty and P is finite. 



Proof of ( (j7| j. It suffices to show 



(22) 

Since the Sj satisfy ([2T]). we have 



\F\ 



5^1/5,1) <16|F|-2(J] J] K/,0p)| 



i=i PeSj 



Now 



i=i PeSj 



j=l PeSj 



< 



Y.^{f,4>p)<t>p 

j = l P£S, 



n n 



^i^iEEE E \u,m\{f,<^p')mp,4>p')\ 

j=l Pg5j k=l P'&Sj 

where in the last inequality we used |/| < Ip. By symmetry, it remains, for (|22p to show 
that 

n n n 

(23) EEE E \u^<t>p)\\{f,ct^p')\\{4>pAp')\<s^Y.\^s,\ 

j=ik=iPeSj p'eSk.\i'\=\i\ i=i 

and 

(24) EEE E \{f^M\\{f,^p')\mp,^p')\<£'j2\^s,\. 

j=l k=l PeSj P'GSfe:|/'|<|/| j = l 

In both cases, we will use the estimate 



(25) 



l(0P,'/'P')| < [^y^\wi,lr 
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which holds whenever |/'| < |/|. 

Estimating the product of two terms by the square of their maximum, we see that the 
left side of (IH is 



^2EEE E \if^<pp)\'\i^p^^p')\- 

j=i k=i PeSj p'eSk.\r\=\i\ 

Recall that {4>p,(l)'p) = unless Caw^ n CsLoi, / 0. Thus, by ([16]), ([25]) and the fact that 
the Sk are pairwise disjoint, we can estimate the last display by 

^EEk/''^^)!' E <EEk/'^^)i'^E^V5,i. 

j=lPeSj I'-\I'\ = \I\ j=lP&Sj j=l 

This finishes the proof of (|23p. 

Applying the Cauchy-Schwarz inequality, we see that the left side of (|24p is bounded 

by 

E( E \if^^p)\'f\ E (E E \{f^^p')mp^^p')\)T'- 

j=i PGSj p&Sj k=ip'eSk.\P\<\i\ 

Twice using the fact that the energy of P is bounded by £, we see that the last display 
is 

2\ 1/2 



|i/2. 



2 



i = l P&Sj fe=l P'g5fc:|/'|<|/| 

Thus, to prove (j24|) it remains to show that, for each j, 

(26) E(E E \{<^pMp'r^p')\y <\is, 

PdSj fc=lP'e5fe:|/'|<|/| 

Again, we only have \{(j)p, \ Ipi\^^'^4>p')\ nonzero when C^Uu n C^oj'^ ^ which can only 
happen if swpC^oJu S C'^uj'^ or inf Csw^ G C^uj'^. Applying (p5]) . we thus see that the left 
side of (j26p is dominated by a constant times the expression 



Ei^H(E E (-/^P)) +Ei^H(E E (-^'1 

P&Sj k=l P'65fc;|/'|<|/| P(^Sj k=l P'GSfc:|/'|<|/| 

sup C3U}u£C3U}'^ inf C3Ll)u<^C3Uj'^ 

We now claim that, for each P G Sj (with time interval I), 



(27) (Y, E (^/ap))'<(^/,lM\/s.) 



k=l P'eSfe:|/'|<|/| 
supC3<^„GC3aj(, 

and that the same inequality with sup replaced by inf in the P' summation holds as 
well. To see this consider two multitiles P^ = (I-^,a;^) S S^i and P^ = (/^,w^) G 3^2, 
pi / gQ ^Y^^^ |j2| < |ji| g^^j (j^^i PI (-^^^2 _^ 0_ rpj^g ^^g^ condition implies ki / ^2 

(since and S^^ are ^-over lapping) . The inequality (|27p is immediate if we can show 
that /I and are disjoint and if in addition |/^| < |/i|, then I2 does not belong to 
the top interval of the tree ■ Now, if = |/^|, then from (|16p it follows that 
ool = ool and hence, since P^ ^ P"^, we have n = 0. If |/^| < |Ji|, then by (fT5]) 
< since C^ul n Cso;^ / this implies that inf C2UJI > supC2a;f. As 

both trees are /-overlapping the top frequency of S^i belongs to C2^jjj and is above the 



14 



R. OBERLIN, A. SEEGER, T. TAG, C. THIELE, J. WRIGHT 



top frequency of which belongs to C2Ujf. Thus by the maximahty condition on the 
top frequency in the selection process of the trees we see that the tree was selected 
before the tree 5^2, i-e. ki < K2- This implies that P2 does not belong to the tree T^^^, 
and since the interval ujs^^ is contained in the convex hull of ujf and we see that the 
time intervals and It = Is cannot intersect. Thus n = as C /r • This 
concludes the argument for (j27p . 

Now by the disjointness condition we see that indeed the left hand side of is 
bounded by a constant times 

PeSj i-2''<\lsj\ P€Sj:\I\=2' 

< 1/5,1 sup Yl i'^l^'^Ws^) 

One may check that, for each i, 

P€Sj:\I\=2'! 

and (j26p follows. We have already seen that (|26p implies (|24p ; this completes the proof 

of (HZD. 

Proof of il8\) . We need to show that for each dyadic interval J, we have 



I \ J J ^^'X' \ J J T^T 



dy 



dx < 2''£-\ 



\J\ 

This is an immediate consequence of 

(28) \It\<£^^AJ\. 

tgt 

Let 

T = {T G T : It C 2^+^ J, \It\ < \ J\} 
and note that if T G T with 2^/t n J / 0, J then T G T. Write f = f + f" where 
l/'l < lFn2^+5j and |/"| < lirnR\2«+5j- 



We will write T as the union of collections of trees T""*" U T° U U . . . each of 
which will have certain properties related to the energy. For each tree T G T there is an 
1-overlapping tree S = S{T) chosen in the algorithm above with Is = It and 

(29) j;K/,<^p)P>fV4. 
Let 

\Is\^^ 

For j > 1, define 

T^={rGT: sup J-Y\{f"Ap)\'>£'n(^} 
S'CS{T) l^S'l 
\Is,\=2-^It\ 
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where, for each T, the sup above is taken over all /-overlapping trees S' with S' C S{T). 
Finally, let 

rpmain = {T G T \ (T° U U . . .)}• 

We split the sum (j28p into the "main" term involving trees in T™^"^ and an error term 
involving Uj>oTj. 

We first consider the main term. It is our objective to prove 
(30) Yl \It\<£-'2'\J\. 

Let T G x™'^™ and let S' be any /-overlapping tree contained in S satisfying < 
Since the energy of P is bounded by £ and since T is not in any T-^, we have 

I A I pgg, I i I p^g, I i I p^g, 

From ([29]) and the fact that T ^ T°, we have 

E \{f',m'>£'/8-£'ne = £'n(^- 

This inequality allows us to essentially repeat the above proof of ()17p . The /-overlapping 
trees S{T) form a (finite or infinite) subsequence of the sequence Sj which we denote by 
Sji^i^) so that we have j(z^) > for > v' . We need to prove the analogue of (j22|) 

which is 

^"^^^ \p pi 2^+5 J| ^ I'^'^jX'') I ^ ^ 

and as before we are aiming to estimate the square of the expression on the left hand 
side by the expression itself. The Sj(^^^ satisfy 



and therefore 

^2 



^T^EI^-.J) < 256|Fn2^-V|-(E E K/'^-)l 

We continue to argue with exactly the same reasoning as in the proof of (|22|) . replacing 
/ with /' and F with F D 2^+^ J. This leads to the proof of ^ and thus to 

E \iT\<£-'\Fn2'+^J\ 

which is clearly < £~'^2^\J\. Thus (pOj) is established. 

For the complimentary terms we prove better estimates, namely, for j = 0, 1, 2, . . . , 
(32) E \It\<2-'-^£-V\- 
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For each T G and P e S{T) we have |/| < 2^^It\ < 2~-'| J|. Thus 

TgTJ TgTJ PeS{T):\I\<2-3\J\ 

Since the S are pairwise disjoint, the right hand side is 

^2^'^"'E E K/^-^p)!'- 

k>j P:|/|=2-'=|J| 

Fixing k > j, we apply Minkowski's inequahty to obtain 

E \if"^^p)\'<( E ( E \{^Kf,<Pp)\'yy 

P:|/|=2-*|J| ^K:\K\=2^-''\J\ P: |/|=2-'^ | J| ^ 

/C2^+lj ft'n2«+2j=0 /C2'^+1J 

where above, we sum over dyadic intervals K and use the fact that /" is supported on 
M \ 2^+5 J. 

Now note that (ppi = cexp{2TTi{c{uj'^) — c{uju))-) (pp when / = /'. Thus if V{Io) is a 
collection of disjoint multitiles with common time interval /o, if 5 is supported on CIq 
and if Pj^ is any fixed multitile in V{Io) then Bessel's inequality gives 



E \i9,^p)\^<\Io\ J \9<pPjfdx. 



P&V{Io) 

We apply this observation to the inner sums in the previous display and obtain the 
inequality 

(33) E \if"^<pp)\'^^~'\J\( E ( E U^f"M\\l.fy 

P:|7|=2-'=|J| K:\K\=2'^-''\J\ r.\I\=2-''\J\ 

/C2*+lj Kn2^+^J=0 /C2^+ij 

where for each /, Pj is any multitile with time interval /. Since \f"\ < 1 the bound (jl4p 
yields 

Ih f"^ l|2 < h -U <iist{K,I) \-N 
W^Kl (PPi\\l2 ^ [i^^ \T\ — ) ■ 

Applying it with large N we see that the right hand side of is < 2~('^+^)(^~^)2~'^| J|. 
Summing over k > j we obtain inequality (I32p . This concludes the proof of (llSp . 

Proof of (fSOj) . For each T G T, let 5 = S{T) be the corresponding /-overlapping tree 
from the selection algorithm above and recall 

E I^t|(^:/2)^ < E \if^M\'- 

Since P = Ut'gt' right side above is dominated by 

(34) E E ia<^p)i'+E E i(/''^^)i' 

T'eT' PeT'nUTgT •S' T'eT' PeT'nUrgT •S' 

+ E E i(/''^^)i'- 

T'6T' PGT'nUTgT'S' 

sup C2(x!; <$y/ <inf Csuju 
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For each T' G T' the set of tiles P £ T' r\y]^^r^ S{T) with the property that S^t' £ is 
by definition an ^-overlapping tree. Thus, since P has energy bounded by £, we estimate 
the first term in (j34p 

E E ia^oP< E ^Vt'I. 

r'sT'PeT'nUTsT'S' t'gt' 

Cy/ GC2(^! 

For the estimation of the second term in (I34p we observe that any fixed multitile forms 
an ^-overlapping tree (with respect to some top data), and we use the energy bound for 
fixed tiles. We observe that the rectangles {/ x [inf Csw^, sup (720;^) : P G IJxeT ^} 
are pairwise disjoint. Indeed ii P & S, P' £ S this follows since S is /-overlapping 
(c/. Observation 14. ip . If P G S^^ and P' £ «^i < ^^2 then ,^5^ > ,^5^ and an 
overlap of [inf Csw^, sup C2ti;„) and [inf Csw^j, sup C2ti;(j) would imply that P' belongs to 
the maximal tree with the same top data as S^i, i.e. this would imply that P' € T^-^ 
which is disjoint from 8^2- 

This allows us to estimate for any fixed T' € T' 

E \{f,^p)\'< E s'\i\<s'\it'\. 

Per'nUrgT'S' Per'nUrgx^ 

S^j,/ >inf CsuJu ij^i >inf C-j,u)u 

Now sum over T' G T' and it follows that the middle term in (I34p is < ^jn,grj„ 
and summing 



For the estimation of the third term in (|34p we begin with a preliminary observation, 
also related to the selection of the 8. Suppose P£T'ri8,P£T'ri8 where T,T £ T 
and 

Ct' e [sup C2UJ1, inf Csuju) n [sup (720;;, inf Csa)^), 
and suppose I C I and P ^ P. From (I16p we have I I- We also have inf C2tD/ < 
sup C2i^i since otherwise it would follow that 8 was selected prior to 8 and hence P £ T 
which is impossible. From (jlSp . we have inf C2ti;; > supCaW; and so P is in the maximal 
/-overlapping tree contained in T' with top data (/, inf C2OJ1). 

For each T' £ T' let T" be the collection of multitiles P £ T' r\ (JreT ^ with ^t' G 
[sup (720;; , inf Cs^Ju) and / maximal among such multitiles. Then 

E E \u,<^p)\'< EE E K/'^-)i' 

T'£T' PeT'nUTex-^ T'GT' P"£T" PGr'nU^gT 

sup C2ti^i <5t' <inf C:iLJu sup 02^1^; <5t' <inf Csuiu 

Id" 

Considering the discussion in the preceding paragraph, we may apply the energy bound 
to the maximal /-overlapping tree contained in T' with top data (/, inf C2UI1) ■ Since the 
/" are disjoint sub intervals of It' we see that the right side in the last display is bounded 
by 

E E < E 2^^Vt'|. 

This completes the proof of (f20]l . □ 



The proposition below is for use in tandem with Proposition 14.31 
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Proposition 4.4. Let F be a collection of multitiles and // > 0. Then, there is a 
collection of trees T such that 

(35) j;|/H</^-'''|i?l 

TeT 

and such that 

density (P \ Utst^) < fi/2. 

Proof. We select trees through an iterative procedure. Suppose that trees T,-, T-^, T~ 
have been chosen for j = 1, . . . ,k. Let 

k 

Pfc = p\ Ur,-ur+urr. 

If density(Pfc) < then we terminate the procedure and set 

T = {ri,T+,rf,...,Tfc,r+,r-}. 

Otherwise, we may find a nonempty tree T C Pfc such that 

(36) ^ X (1 + ^T^)~' E i«'^(^)r' > 

Choose T/c+i C Pfc so that l-^r^+il is maximal among all nonempty trees contained 
in Pfc which satisfy (I36p . and so that T^+i is the maximal, with respect to inclusion, 
tree contained in P^ with top data {iT^+x^^Tk+x)- Let T^j^^ C P^ be the maximal tree 
contained in P^ with top data (-^t^+i , Ct^+i + {C2 — l)/(2|-^Tfc_|_i I)) and T^-^ C Pfc be the 
maximal tree contained in P^ with top data (/Tfe+u^Tfc+i — (C2 — l)/(2|-^Tfc+i |))- Since 
each Tj is nonempty and P is finite, this process will eventually stop. 

To prove (I35p . it will suffice to verify 

(37) ^|/T,|<^"^'|i?|. 

j 

To this end, we first observe that the tiles It^ x ojt^ are pairwise disjoint. Indeed, suppose 
that [Itj X i-OTj) n {It I x '^t ,) 7^ and j < j' . Then, by the first maximality condition, 
we have \lTj\ ^ l-^Tj'l and so It., C Itj and \ujTj\ < I'^T^/I- From the latter inequality, 
it follows that for every P £ Tj/, either 0;^. C cOm, C UJm, or ujrp~ C Um- Thus, 

Tj' C Tj U Tj' U T^~ which contradicts the selection algorithm. 

Breaking the integral up into pieces and applying a pigeonhole argument, it follows 
from ()36p that for each j there is a positive integer ij such that 

(38) \It,\<C2-^'^^i-''' f \ak{x)\'' dx. 

For each £ we let = {Tj : ij = £} and choose elements of T^^); T^^\t^'^\... 
and subsets of T^^^: Tf\T2 \ ... as follows. Suppose T-^^ and T^^^ have been chosen 
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for j = 1, . . . ,k. If T^^-* \ U^=i '^f^ is empty, then terminate the selection procedure. 
Otherwise, let Tjf^^-^^ be an element of T^^) \ IJj=i with \I (i) \ maximal, and let 



k 

tI;1, = {Te tW \ U Tf : i2'lT x ujt) n (2^/^(.) x u;^,,, ) / 0}. 



By construction, T^^^ = (J • T^-^^ and so 



(39) |/t|<E E I^^I- 

Using the fact that the tiles Itj x are pairwise disjoint, and (twice) the fact that 

I-^tI < \ for every T G T^f\ we see that for each j 
j 



/J^ ^ rj-i ( ^ ) 



From ()38p . we thus see that the right side of (j39p is dominated by a constant times 

2- v^7 E ,0 E k(^)r'^^ 



i J fc:,tfc_i(x)Ga;^(£) 

3 



<2-2V-"'|ii;|. 



where we used the disjointness of the rectangles "^^IrpW x u; (o, and X^^^ |afc(x)|''' < 1. 

i 3 

Summing over we obtain (j37|) . □ 



5. The tree estimate 



In this section we prove the basic estimate for the model operators in the special case 
where the collection of multitiles is a tree. In what follows we use the notation V^Af{x) 
for the r- variation of i— )■ Akf{x), for a given family of operators indexed by A: G N. 

An essential tool introduced to harmonic analysis by Bourgain [IJ is Lepingle's in- 
equality for martingales ([22]). Consider the martingale of dyadic averages 

mf]{^) = TY^ [ f{y)dy 

\^k[X)\ Jl^(x) 

where Ik{x) is the dyadic interval of length 2^ containing x. It is a special case of 
Lepingle's inequality that 

(40) ||VE(.)[/]||iP <Cp,,||/||LP 

whenever 1 < p < oo and r > 2. Simple proofs (based on jump inequalities) have 
been obtained in [1] and [27] (see also [9], [15] for other expositions). Inequality ([30|) 
has been extended to various families of convolution operators ([I],[I1], [2], [H]). Let 
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■0 be a Schwartz function on M with J ijj = I, for each k let Vfc = 2~''^(2~*^-) and let 
Akf{x) = ipk*f- Then 

(41) IIVWIIl. < K p < oo, r > 2, 

follows from (|40p : the essential tool is the square function estimate 

1/2 



(42) IK Yl \Akf-mf] 

k=—oo 

For a proof we refer to [13] or |15] . 



1 < p < oo. 



We shall use ()4ip to prove the following estimate in terms of energy and density. The 
bound will be applied in Section [7] with q = r' and (7 = 1. 

Proposition 5.1. Let S>0,0<^<3 and let T be a tree with energy hounded above 
by £ and density bounded above by fj,. Then, for each 1 < q < 2 



(43) V(/,0p)0paplij <^:/.'-°(i'^''/'?)|/r|V5. 

II ^-^ Li 

Furthermore, for £ > we have 

(44) II T.(f^<Pp)'PpaplE\\ < 2-^(^-io)^:^--(i.^'/.)|j^|i/.. 

We remark that the bounds above also hold for 2 < g < oo, but the result for this 
range of exponents is not needed for our purposes; it requires an additional estimate 
for T.P(iTU^4^p)(t)P- 

Proof. We begin with some preliminary reduction. Every tree can be split into an l- 
lacunary tree and an /-overlapping tree and it suffices to prove the asserted estimate for 
these cases. 

If the tree is /-overlapping we introduce further decompositions. By breaking up T 
into a bounded number of subtrees we may and shall assume without loss of generality 
that for each P £ T, 

(45) .^T G f^/ + for some integer j with \j\ < C2. 

Moreover we shall assume, for every /-overlapping tree T, that either < infw/ for 
every P G T (in which case we refer to T as /"-overlapping) or > inf for every 
P G T (in which case we refer to T as /"''-overlapping). Every /-overlapping tree can 
be split into an /"-overlapping and an /"^-overlapping tree. For the remainder of the 
proof, we assume without loss of generality that T is either /-lacunary or /"'"-overlapping 
or /"-overlapping, and that for the last two categories property (jl5|) is satisfied. 

Let J be the collection of dyadic intervals J which are maximal with respect to the 
property that I <^2>J for every P G T. 



Our first goal is to prove that for each J € J 



PeT:\I\<C"\J\ 
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where C" > 1 is a constant to be determined later; we shall see that C" = 8Ci(C2 — 
is an admissible choice. 

By Holder's inequality, we may assume that q > r'. Fix P & T with |/| < C"\J\. 
From the energy bound, we have 

(47) \\{f,cl>p)cl>paplE\\LHJ) ^ -^(1 + ^^^y^'hp'^EWLHJy 

From the density bound applied to ~ 1/(^2 — 1) nonempty trees, each with top time 
interval I, we obtain 

Since / ^ 3J, it follows that 1 + \x - y\/\I\ < C(l + dist(/, J)/\I\) for every x £ J and 
y £ I. Thus 

l|apli.|lL(.) < WaplEW^.^j^ < (l + ^)Vl/ 
where, above, we use the fact that \ap\ < 1. Thus we can replace (j47l) by 

||(/,^p)^papl^||,,(,)<f//^|/|Vni + ^)-(^-^\ 
Summing this estimate and using the fact that T is a tree, we have 

y: {fMct^papiE ^^^^^ <^://''(2-vi)^/ni+^nS^)"^''"'^ 



PeT:|/|=2-fc| J| 

and summing over k gives (jl6 



We now use ()46p to prove (I44p for £ > 4. Indeed, using the maximality of each J, we 
see that if £ > 4 and J n (M \ 2^/t) / then dist(/T, J) > \ J\/2 and | J| > 2^-3|/r|. It 
thus follows from (1461) that 



II Y^fAp)<t^paplE , ^ < (|/T|/|J|)(dist(/T, J)/|J|)"2^:;,--(iy/.)|J^|l/52-nA^-10) 

" PeT ^'^-^^ 
whenever J n (M \ 2^ It) ^ 0- Summing over all J, we thus obtain ()44p for £ > 4. 



It remains to prove 



PeT ^ ' 

which, by ()46p . follows from 

(48) V (/,0p)0papl£; ^ <£:/i--^'^'/9)|j^|i/9. 

PGT: |/|>C"|J| 

Again, by Holder's inequality, we may assume that q > r' . Let 

PGT:|7|>C"|J| 

The first step in the proof of (|48p will be to demonstrate 



(49) / Y W{xj' dx<ii^'\J\. 

Jjr\E , , , ^^r^ 



By the maximality of J there is a multifile P* = (/*, w*) G T with /* C 3 J where J is the 
dyadic double of J. This implies that there is a dyadic interval J' with \J\ < \J'\ < 4| J| 
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and dist(J, J') < |J| and /* C J'. We wish to apply the density bound assumption to 
a tree consisting of the one multitile P* , with top interval J' and suitable choice of the 
top frequency. We distinguish the cases that T is /"^"-overlapping, /"-overlapping, or 
/-lacunary. 

If T is /"^"-overlapping then T' = J') is a tree. For every P £ T we have 

uji C [^T-^,^T + ^) and thus, with \I\ > C"\J\, uji C [Ct - 2#pT> + 2#Pt)- Thus 
with the choice of C" > -^^^ we have Oj C ujt'- 

If T is /--overlapping then T' = {{P*},^t + J') 

is a tree. Using that T is / - 
overlapping, we see that uji C [S,t,^t + 2C'^ \ J\ ) every P & T with |/| > C"|J|. Thus, 
by choosing C" > we have fij C wt'- 

If T is /-lacunary then T' = {{P*},^t- J') is a tree. Using that T is /-lacunary, 
we see that uji C [£,t — Ic^^j^t) for every P £ T with |/| > C"\J\. Thus, by choosing 
C" > 8j=r^ as in the first case we have j C lo^'- 

In any of the three cases, the density bound gives 



\J' 

and hence (1^91). 



fc:Cfc_i(a;)ei^j,/ 



We now show that if T is /-lacunary then (j48p follows from (j49p . We start by observing 
that for each x there is at most one integer m and at most one integer k such that there 
exists a P £ T with |/| = 2™, G uji, and ^fc(x) G w/j. Indeed, suppose such 

a P exists, and P' £ T with |/'| > |/|. If |/| = \I'\ the uniqueness of k is obvious. 
Suppose \I'\ > Since T is /-lacunary, we have inf(ti;;) > sup(w/) by p^ . and so 
< inf(a;j'). We also have ^k{x) > mf{ujh) > sup(ci;t) > sup(a;;') since C2UJu^^h = 
and C20j'i. It follows that there is no k' with ^y_i{x) G uj[. 

Now let a{x) = ak{x) if there exists an m{x) as in the previous paragraph with 
2in{x) y C"\J\, and 0(2;) = otherwise. We then have 

V {f,4>p)<PpapiE \ < [ (Hx)\ J2 \{f,4>p)Mx)\y dx. 

^"^^ Li(J) I TnR ^ ' 

PGT:|/|>C"|J| PGT:|/|=2'"(^) 

From the energy bound (applied to multitiles) and the bound (jl4p for |0p|, the right 
side is bounded by 



\x~c{I)\\-N' 



1 

dx. 



[ [\a{x)\ S\I\m{x)y dx < [ [\a{x)\ £: (l + ^ 

Noting that EpeT:l/l=2'»w (1 + 1^ " c(/)|/|/|)~^ < C, we see that the last display is 



<S^ ! \a{xWdx<£'^ [ ( Yl 



dx, 
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by our choice of a{x). Using (ji9]) . q > r', and the fact that ^ |afc(x)|''' < 1, the right 
hand side is < S'^fx'^ \J\. We may now sum over those J G J which satisfy J D 16It 7^ 
and (I48p foUows for /-lacunary trees. 



It remains to prove ()48p for the case when T is /-overlapping and satisfies condition 
l5|) . For each J G J, and each x £ J D E we have by Holder's inequality 

(50) Yl {fAp)Mx)ap{x) = Y,ak{x) 

P&T:\I\>C"\J\ k P&T:\I\>C"\J\ 



E 



< 



fc:5fe_l(x)eOj 



E 

fc:5fe_l(a;)er2j 



E 



{f,(t>p)Mx) 
{f,<Pp)M^) 



r\ 1/r 



PeT:\I\>C"\J\ 
ffe_l(x)Ga;i,g(.(2:)Ga;ft 



Now let tp he a Schwartz function with ^p{^,) = 1 for |^| < Ci + C3 and '0(^) = for 
\C\ > 2Ci. Define ipi = 2-V(2"^-)> and set 



erix) 



We will show that for any x and any k with ^^-1(2;) < (,kix), there exist integers £1,^2 
depending on x and k, such that 2^^ > | J| and 



(51) 



E 

PeT:\I\>C"\J\, 
Cfc_i(a;)Gi^i,Cfe(a;)ei^h 



PeT 



|/| for some multifile P, 

bp)(j)p{x). 



From ()15p we have, for each ^ such that 2 

PeT PgT:|/|>2« 

Thus, to prove (j5ip it will suffice to show that there exist integers ^l and £2 such that 

(52) {P G T : |/| > C"\J\,ik-i{x) G w/,6(^) G ^4 = {P G T : 2^^ < |/| < 2^^. 

Again using (|15p . we see that for P,P'£T with |/| < |/'| we have info;^ < iniuh, and if 
we are in the setting of p-multitiles where p is a 1-index, we have the stronger inequality 
supcj^ < inf Wft. Thus, (|52p will follow after finding ii and £2 with 

(53) {P G T : efc-i(x) G ui} = {P € T : 2^^ < |/| < 2^^. 

Here we use assumption (|45p . The displayed equation follows when |j| > 1 from the fact 
that uji n a;[ = if P,P' £ T and |/| < |/'|; it follows when j = from the fact that the 
intervals {oJi : P G T} are nested. Finally, when j = ±1 it follows from the property 
that if P, P', P" G T, |/|, < |/'| < and uji n uj'{ / then uj'{ C uj[ C t^/. Thus we have 
established (j53|) and consequently (j52]) and ([5T]) . 



Using dsn, we have 



E 



E 



(/, 4>p)4>p{x] 



r\ 1/r 



PeT:|7|>C"|J|. 
Cfc-i(a;)ew;,^fc{a;)Gi^(i 



< ll(eT^fc) * V(/,0p)0p(x)| 



pgt 



y,r(z++iog2(|j|)) 
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where the notation refers to the variation norm with respect to the variable k, restricted 
to {k £Z: \k\ > logad J|)}. For logsd J|) < < we have 



Per 



P€T 



and so, for x G J 



l(eT^fc) * ^{fAp)(t>p{^ 



P&T 



ly;{Z++log2{|J|)) 



< sup sup 



x&J R>\J\ \R\ Jx-R 



x+R 



l(eT^fe) * ^{f,cl)p)(l)p{y)\\yr 



P£T 



;'-(z++iog2(|j|)) 



dy. 



We now integrate the qth. power of the expressions in (|50p over E H J and obtain 

q 



{fAp)<PpaplE 

PGT:|/|>C"|J| 

■X+R 



< 

LHJ) JjnE 



^ \ak{t)\^'dtY'''x 

fc:5fe_i(i)GCj 



2 rx+JX 

sup sup — / Wieri'k) * (/> 'Ap)'^p(y)||v7-(2++iog^(|j,)) dy. 



< 



dx 



M[\\^k*{eT'Y.^f^<t>p)4>p)\\v^]{x) 
PeT 

where A4 is the Hardy-Littlewood maximal operator. For the last inequality we have 
used q > r', |ofc(x)|^ < 1 and inequality (09]). Summing over J £ J gives 



{f,(l)p)(l)paplE 

P£T:\I\>C'"\J\ 



Li{ieiT) 



\v,: [X) 



PeT 



IL|(16/t)' 



Since g < 2, it follows from Holder's inequality that the right side above is 



pgt 



L2(16/t)- 



Applying the variation estimate (|4ip with p = 2 and the estimate for Ai one sees 
that the display above is 



< 



\It\ 



(2-q)/2\ 



PeT 



')P (pp 



1 

L2' 



To finish the proof, it only remains to see that || ^P(cr(/, </'p)0p||^2 ^ £'^\It\- The 
left side of this inequality is dominated by 

E E \{f^^pmf^<p'pm<pp^<pp')\ < 2 E \(f^^p)\' E k-^^^p')!- 

PeTP'eT PeT P'eT 

Since T is an /-overlapping tree, we have {4>p,(f)p') unless |/| = |/'|, in which case, we 
have \{(l)p,(pP')\ < (l + dist(/,/')/|/|)-^. Therefore we obtain the estimate 

II E^/''^^)'^^||L ^ E \(f^^p)\' ^ ^^Vt|. 

PeT PeT 
This concludes the proof of (j48p and thus the proof of the proposition. □ 
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6. Two AUXILIARY ESTIMATES 



Before we give the argument on how to decompose our operators into trees with 
suitable energy and density bounds we need two auxihary estimates. 

The following proposition can be found in ^3], p. 12, or as a special case of a lemma 
from 



Proposition 6.1. Let T be an l-overlapping tree. Let A > and f^A,!) = {.A/^d[1f] > M 
where Md is the maximal dyadic average operator. Then 

' ' P(^T:I(tnx,D 

The second auxiliary estimate is the special case of an estimate from [12] , but we will 
provide a proof for convenience. 

Proposition 6.2. Let V he a finite set of multitiles, and let X > 0, F C M, and \ f\ < 1^- 
Let nx = {M[1f] > A}. Then 

(54) II Yl if^M<Ppap\ ' 

PeF-.icn 



< 



L^R\n) AVf ■ 



Proof. Fix i and let Li G 0,\ he a, dyadic interval satisfying 
(55) 2^ Li C Ux and 2^+^Li ^ fix- 

By Minkowski's inequality, we estimate 

P:I=h 



f,^p)\') ■ 

P:I=h j=2 P:I=h 

Using orthogonality, the right hand side is bounded by 

oo 

(4|/,|)l/2||l4,J0pJ|^.+^(2^-+V£l)'/'l|l2.+i/A2^7,/</'PollL2 

i=2 

where Pq is any multitile with L = L^. Applying the bounds (|14|) and |/| < Ip , we see 
that the last display is 

oo 

< |F n 4/,|i/2 + ^ C2-J'(^-i)|F n 2^'+i/,|i/2. 

i=2 

Since 2^+^Le ^x, we have \F n 2^+^L(,\ < C2™'^^(^J)|/<.| A for each j. Thus, the last 
display is < (2'A|/i|)-^/^ and we have proved 

( E \{f,<Pp)\'f'<{2'xm'/'. 

P:I=h 

Similarly, 

sup \{fAp)\<2'm^'^ 

P:I=I, 
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and SO, by interpolation, 

(56) ( E \{f,^pWf'<i2'X)'/^'\M'/' 

P.i=h 

whenever 2 < r < oo. For each S^, Ii there is at most one P G P with £ ivi and I = I^. 
Thus, using the fact that, for each x, Xl^i < 1, we see that 

II Yl (/>'Ap)<Apap||^ <(2^A)V^'|/.l^/'ll'APollLHM\f..) 

PeP:I=Ii ^ 

where Pq is any multitile with Iq = Ii. Using the fact that 2^1^ C Qx, it follows that the 
right side above is 

< 2-^(^-2) A^/*^' I If I . 

For £ > let Ig be the set of all dyadic intervals satisfying (|55p. If / C then for each 
j > there are at most 2 intervals I' G Ig with I' C I and |/'| = 2^-'|/|. By considering 
the collection of maximal dyadic intervals in Ii, one sees that 

leXe 

Thus, 

II E (/>'/'p)'Apap||Li(R\n,)< 2-^(^-2) Ai/^'|i7,|. 
PeP-.ieXe 

Summing over i and applying the weak- type 1-1 estimate for M then gives (j54p . □ 



7. Conclusion of the proof 

Let r > 2 and r' < p < We shall conclude the proof of (jl3p . with 1 < l-Bl < 2, 
and thus of Theorem 11.21 It will then suffice, by Chebyshev's inequality, to show 

(57) / \SP[f]{x)\dx <C\F\^/P 

Je\g 

for any measurable set E with 1 < |i?| < 2, |/| < Ip and some exceptional set G = 
G{E,F) with |G| < 1/4. 

We shall repeatedly apply Propositions 14.31 and 14. 4[ By Lemma 14.21 the density of P 
(with respect to the set E above) and the energy (with respect to /) are bounded by a 
universal constant C. 

We distinguish between the cases |F| > 1 and |-F| < 1 and first consider the case when 
|-F| > 1. Repeatedly applying Propositions 14.3 1 and 14.4] we write P as the disjoint union 

p = U u T 

j>0 TeTj 

where each Tj is a collection of trees T each of which have energy bounded by C2-J/2 1 F| 1/2^ 
density bounded by C2~^/^ , and satisfy 
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For each j we apply Proposition 14.31 again, this time using (fTSj) and ([20]) to write 

U T=u u ^ 

TeTj fc>OTGTj_fe 

where each tree T € Tj ^ has energy bounded by C'2~(-'+'^)/^|F|^/^, density bounded by 
C2~^/^\ and satisfies 

(58) \It\<'^'- 



< 22^2-''+'' I Fl"^ 



Moreover, for every £ > 
(59) II ^^'iT 

Inequahty ([58|) imphes || X^T-gXj j. ^2^/^ H^^ ^ 2^"'"-' , and we may interpolate the and the 
BMO bound. Here we use a standard technique involving the sharp maximal function 
from §5 in [11]. It follows that for 1 < q < oo 



Let € > be small and C" > be large, depending on p, q, r. For each j, k, I define 

By Chebyshev's inequality, we have 

so setting G = {Jj^k,e>o^j,k,£ we have \G\ < 1/4. 
Applying Minkowski's inequality gives 



Is E (f'p)<PP"'p\ 
PeP 



+ EII^^ E l2^M2^-l/T E^^''^^)'^^«^ILl(K\G,,.,)7• 
From Holder's inequality, Fubini's theorem, and the definition of Gj^/^^i, it follows that 
the right side above is < {Si + S2) where 



5i= E E \\^EY.(f^cPpHpap\C,^ 

j,k>0 TGTj.fe PeT 



l/r' 



and 

5,= 5^|i.|-V(.V)2(i+.)0>.+2.)A.( E l|li^E(/'^^)^^«HIL'^'(M\2-.z. ''''' 
j,k>o tgTj fe PeT 
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Applying Proposition 15.11 with the energy and density bounds for trees T G Tj^^, we 
see that 

52 < ^ |_p|-l/{9''^)2(l+^)(j+^+2^)A'-2-^(^-10)2-(j+fe)/2|_p|l/22-j/r'J^ ^ |/t|^^^''' 



i>i 

Choosing e small enough and q large enough so that 

2 111 

(1 + e)- - 1 < and < - 

r 2 g'r p 

we have 52 < |-F|^/p. We similarly obtain Si < thus giving (|57p . 

We will finish by proving (j57p for |F| < 1. Here, we let 

G = {x : M[1f]{x) >C"|F|} 

where M is the Hardy-Littlewood maximal operator and C" is chosen large enough so 
that the weak- type 1-1 estimate for M guarantees |G| < 1/4. 

By Proposition 16.21 and the fact that p > r', it will remain to show that 

(60) ||l^E(/''^^)'^^«^|LwK\n^'^''^'' 

PeP' ^ ^ ' 

where P' = {P G P : / ^ G}. 

Finally, it follows from Proposition 16.11 that the energy of P' is bounded above by 
C|F|. Repeatedly applying Propositions 14.3 1 and 14.4] we write P' as the disjoint union 

p' = U U r 

j>OT&Tj 

where each T,- is a collection of trees T each of which have energy bounded hy Co2-^/^\F\'^/^, 
density bounded by Co'2r^l'^ , and satisfy 

TST, 

We then have 

Is E '\f^^p)^P^P ^1 - E E l^-f^ '^{f,<Pp)(ppap 
PeP' i>o TeTj PeT 

Applying Proposition 15.11 we see that the right side above is 

<E E min(2-^V2|^|i/2^|^|)2-.A-'|j^| < J]2^-Amin(2-^V2|^|i/2^|^|)_ 

i>0 TgT, j>0 

Summing over j, we see that the right side above is < . This finishes the proof, 

since p > r'. 



A VARIATION NORM CARLESON THEOREM 



29 



A. Transference 



In this section we show how to obtain Theorem 11.11 from Theorem II. 2i We employ 
arguments from chapter VII in the monograph by Stein and Weiss [32] in their proof of 
De Leeuw's transference result ([U]). The following Hmiting relation is used: 

Lemma A.l. ([321 . p. 261). Let m G L°°{W^) with the property that every k is a 
Lebesgue point ofm. Define a convolution operator T on L^(M"') by the Fourier transform 
identity Tf{^) = m{^)f{$^) and a convolution operator on L^(T"') by the relation [Tf]j^ = 
m{k)ffl for the Fourier coefficients. Let, for x G M, 

vj{x) = e~^^'^ and vjr{x) = R~'^w{x/R). 

Then, for all trigonometric polynomials P and Q (extended as 1-periodic functions in 
every variable) we have 

(61) / f[P]{x)Q{x)dx=lim / T[PwR/ ,]{x)Q{x)wjij {x)dx. 



We also need the following elementary fact on Lorentz spaces. 

Lemma A. 2. Let f € LP''?(T'^) and extend f to a function /p"' on M"^ which is 1- 
periodic in every variable. Let L > d/p and let w be a measurable function satisfying 
\w{x)\ < (1 + \x\)"^. Let wr{x) = R~'^w{R~^x). Then 

sup Il/P'^'-Wijll^p.^^igd) < Cp,g||/||ip.,(Trd) 



Proof We first assume p = q. Let Qo := [-^ ^]'^- IfA^GNandiV<i2<iV + l then 
wr{x) ~ N~'^{1 + \n\/N)~^P for x G n + Qq and by the periodicity we can estimate 
||/p-u;k||^ by C E„ez^ N-^{1 + \n\/Nr^P\\f\\l,^Q^^ which is < since Lp > d. 

For fixed L we apply real interpolation in the range p < L/d and obtain the Lorentz 
space result. □ 



Proof that Theorem implies Theorem \l.l{ We shall assume that \ < p < oo, 1 < 
qi < oo, and qi < q2 < oo and prove that the ^^''^^(M) —?■ LP'''^{M.;V'') for the partial 
sum operator 5 on the real line implies the corresponding result on the torus, i.e. 



(62) 



sup sup 

K 0<ni<---<nK 



1/r 



LP-12 (T) 



LP'i2 (T) 



By two applications of the monotone convergence theorem it suffices to show for fixed 
K £N with K >2, and fixed M G N that 



(63) 



K-l 

sup I V \Sn,^J - SnJ\' 

0<ni< - <nK<M ^ ~[ 



1/r 



LP-12 (T) 



< c 



LP-11 (T) 



where C does not depend on M and K. 



For n = (ni, . . . ,n/^) G Ng", 1 <i < K -1 define Tf{x,n,i) = Sn,+^f{x) - SnJ{x) if 
m < • • • < UK and Tf{x,n,i) = otherwise. Then the inequality ([63]) just says that T 
is bounded from L^'"?^ to LJ''''^{i°°{i^')) where the i°° norm is taken for functions on the 



30 



R. OBERLIN, A. SEEGER, T. TAG, C. THIELE, J. WRIGHT 



finite set {1, . . . , M}^ and tiie £^ norm is for functions on {1, . . . , A' — 1}. By duality 
([63]) follows from the L^' ''^'2 (i^ (i'^' )) — )• L^'^'fi inequality for the adjoint operator T*, i.e. 
from the inequality 



1 A'-l 
''^ 0<ni<---<n/f <M i=l 



K-l 



^ ^ ^ f ^ ^ \fn;i\ 



We fix an irrational number A in (0, 1), say A = l/\/2. We then define "partial sum 
operators" for Fourier integrals by 6t/(0 = X[-a,a](^/*)/(0 ™d a corresponding partial 
sum operator 6^ on Fourier series by letting the kih Fourier coefficient of &tf be equal 
to X[-\,\]{k/i)fk- We define a function z/ : No — )• No as follows: set = and for 
n > let v{n) be the smallest positive integer v for which \u > n. Notice that then 



(65) 



Snij^ifn,i{x) Snifn,i{x) — &u{ni^i) fn,i ^u{ni)fn,i- 



Now in order to prove (j64p it clearly suffices to verify it for the case that the function 
ffli and Q are trigonometric polynomials. The multipliers corresponding to &t are 
continuous at every integer. Thus by (j6T]) (applied with d = 1) and (j65]) we see that ([6] 
is implied by 



(66) 



K-l „i 



E J2 [ ['^u{ni+i){fn,iWR,p')-&^(^ni){fn,iWR,p')]QwRipdx 

0<ni<---<nK<M i=l ^ 



K-l 



n 1=1 



for sufficiently large R. 



Now notice that 6j = Sxt-S_xt so that the assumed LP'«i(M) U''''^{V'' ,R) bound- 
edness for the family {St} implies the analogous statement for the family {©t}. We 
run the duality argument in the reverse direction (now for functions defined on M) and 
deduce 



» K~l 

L E e(s [fn,^ 

•^^0<ni<-<nK<M i=l 



Wr/p'] - Gi,{n,)\fn,iWR/p>\)QwR/pdx 



K-l 



< 



E Ei/n^> 



Wr/p' 



n i=l 



By Lemma I A. 21 the right hand side of this inequality is for R > max{p, p'} bounded by 
the right hand side of (|66p . Thus we have established inequality (j66p and this concludes 
the proof. □ 
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For ^, x G M let 

J —oo 



-oo 

Menshov, Paley, and Zygmund extended the Hausdorff- Young inequality by proving a 
version of the bound 

(67) l|C[/]||^p'(^^)<Cp||/||iP(K) 

for \ < p < 2. The bound at p = 2 is a special case of the much more difficult maximal 
inequality for the partial sum operator of proved by Carleson and Hunt. Interpolat- 
ing Theorem 11.21 at p = 2 with a trivial estimate at p = 1, one obtains the following 
stregthened version of ([67|) 

(68) \nf]h.'^yr)<C, 



^p,r\\j liLP 



for 1 < p < 2 and r > p. It follows from the same arguments given in Section [2] that this 
range of r is the best possible. Our interest in this variational bound primarily stems from 
the fact, which will be proven in Appendix[Cl that it may be transferred, when r < 2, to 
give a corresponding estimate for certain nonlinear Fourier summation operators. The 
purpose of the present appendix is to give an easier alternate proof of (i68]l when p <2. 
Note that Pisier and Xu [27] have proved closely related U' — t- L'^{V^) inequalities for 
orthonormal systems of (not necessarily bounded) functions on an arbitrary measure 
space. 

A now-famous lemma of Christ and Kiselev [5] asserts that if an integral operator 

Tf{x)= [ K{x,y)f{y) dy 
JR 

is bounded from LP(M) to L'^{X) for some measure space X and some q > p, thus 

\\Tf\\LHX)<A\\f\\L,^^), 

then automatically the maximal function 

T*f{x) = sup / K{x,y)f{y) dy 

NgR Jy<N 

is also bounded from LP(M) to L'^{X), with a slightly larger constant. Another way to 
phrase this is as follows. If we define the partial integrals 



T<f{x,N)= / K{x,y)fiy) dy 

Jy<N 

then we have 

(69) ||r</||^,(^^) < Cp,,A||/||lp(m). 

As was observed by Christ and Kiselev, this may be applied in conjunction with the 
Hausdorff- Young inequality to obtain ()67p for p < 2. 

The norm can also be interpreted as the norm, and we will now see that 
can be replaced by for r > p, thus giving (j68p from the Hausdorff- Young inequality. 
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Lemma B.l. Under the same assumptions, we have 

\\'^<f\\Ll(V^) ^ Cp^q^rMf\\LP(R) 

for any r > p. 

Proof. This follows by an adaption of the argument by Christ and Kiselev, or by the 
following argument. Without loss of generality we may take r < q, in particular r < oo. 
We use a bootstrap argument. Let us make the a priori assumption that 

(70) \\T<f hliv;,) < ^^II/IIl.(R) 

for some constant < < oo; this can be accomplished for instance by truncating the 
kernel K appropriately. We will show that this a priori bound automatically implies the 
bound 

(71) \\T<f hiiv^) < i2'/^'^'/'''BA + Cp,,,rA)\\fh.iu) 

for some Cp^q^r > 0. This implies that the best bound B in the above inequality will 
necessarily obey the inequality 

B < + Cp,qy, 

since r > p, this implies B < Cp ^ .^. for some finite Cp g ^., and the claim follows. 

It remains to deduce ([71]) from ([70l) . Fix /; we may normalize ||/||lp(m) = 1. We find 
a partition point A^o in the real line which halves the norm of /: 



J 

J — c 



\f{yWdy= / IfivWdy 



1 



Write /„(y) = /(y)l(_oo,7Vo](y) and f+{y) = f{y)l[No,+oc){y), thus ||/-||lp(m) = 2 
and ||/+||2,p(M) = 2~^/P. We observe that 



T<fix,N) 



T<f^{x,N) when N< No 

Tf_{x) + T^f+{x, N) when iV > iVo 



Furthermore, T<:f^{x,-) and T</+(a;,-) are bounded in L°° norm by 0{T^f{x)). Thus 
we have 

\\T<f{xr)\\vr <{\\T<f.{xr)\\Vr+\\T<f+{xr)\\^^^^^^^ 

(The 0{Ttff{x)) error comes because the partition used to define ||T</(x, •)||y may have 
one interval which straddles Nq). We take L'^ norms of both sides to obtain 

r</llL|v- < ||(||T</_(^,-)lll.^ + \\T<f^{x,-)\\'vrY'^\Ll +0{\\%f\\Li). 

The error term is at most Cp^qA by the ordinary Christ-Kiselev lemma. For the main 
term, we take advantage of the fact that r < g to interchange the V and L'^ norms, thus 
obtaining 

r</llL|y^ < imfAYLivr^ + ||T</+||2,,.^)^/^ + 0{Cp,qA). 
By inductive hypothesis we thus have 

\\T<f\\Llvr < {i2~y^BAY + {2~'/PBAYy/^ + 0{Cp,qA), 
and the claim follows. 



□ 
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C. Variation norms on Lie groups 



In this appendix, we will show that certain r-variation norms for curves on Lie groups 
can be controlled by the corresponding variation norms of their "traces" on the Lie 
algebra as long as r < 2. This follows from work of Terry Lyons |23j : we present a self 
contained proof in this appendix. Combining this fact with the variational Menshov- 
Paley-Zygmund theorem of Appendix |Bl we rederive the Christ-Kiselev theorem on the 
pointwise convergence of the nonlinear Fourier summation operator for LP(M) functions, 
1 < p < 2. 

Let G be a connected finite-dimensional Lie group with Lie algebra g. We give q any 
norm || • ||g, and push forward this norm using left multiplication by the Lie group to 
define a norm ||x||TgG = Il5'~^2;||g on each tangent space TgG of the group. Observe that 
this norm structure is preserved under left group multiplication. 

We can now define the length \ j\ of a continuously differentiable path 7 : [a,6] — )• G 
by the usual formula 



Observe that this notion of length is invariant under left group multiplication, and also 
under reparameterization of the path 7. 

From this notion of length, we can define a metric d{g,g') on G as 

d{g,g')= inf I7I 
■ynm=9n{bj=g' 

where 7 ranges over all differentiable paths from g to g' . It is easy to see that this does 
indeed give a metric on G. 

Integral curves of left invariant vectorfields need not be geodesic for this metric [28], 
but the length of a short segment of such an integral curve is within a quadratically small 
error of the distance between the two endpoints. This is the content of the following 
lemma: 

Lemma C.l. If x € q is such that ||x||g < e for some sufficiently small e, then 



Proof. By considering the exponential curve 7 : [0, 1] — )• G defined by 7(t) := exp(tx) 
we obtain the upper bound d{l,exp{x)) < ||2;||g. Now consider any competitor curve 
7 : [0, 1] — )• G from 1 to exp(x) which has shorter length than ||x||g. We write j{t) = 
exp(/(t)) for some smooth curve / : [0, 1] — )• g from 1 to x; this is well-defined if e is 
small enough. 

There are two cases. First suppose that / stays inside the ball {y : ||2/||g < 2||x||g}. 
Then from Taylor expansion we see that 




d{l, exp(x)) 



x\\, + 0{\\x\\l)- 



7'(t)||T,(,,G=(l + 0(||x||g))||/'(t)||g 



and hence 
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|7| = (l+0(|N|g)) ['\\f'{t%dt. 

Jo 



But from Minkowski's inequality one has Jq ||/'(^)||g dt > \\x\\q, and the claim follows. 

Now suppose instead that / leaves this ball. Let < io < 1 be the first time at which 
this occurs. Then the above argument gives 



i7i>(i+o(iixiig)) rwf'm.dt 

Jo 



By Minkowski's inequality one has /o° ||/'(i)||g dt > 2\\x\\g, and this gives a contradic- 
tion to I7I < ll^llg < e if e is sufficiently small. The claim follows. □ 



Given any continuous path 7 : [a, 6] — )■ G and 1 < r < 00, we define the r-variation 
|7||vr of 7 to be the quantity 



l7ll V"" 



a=to<ti<...<tn=b 



n—1 , / 

\ 1/r 



sup ( ^d(7(tj+i),7(tj))' 



j=0 



where the infimum ranges over all partitions of [a, 6] by finitely many times a = to, ti, 
tr, = b. We can extend this to the r = 00 case in the usual manner as 



1 •^n 



||7||v- = sup sup d{j{tj+i),-f{tj)), 

a=to<ti<...<t„=b Q<j<n~l 

and indeed it is clear that the V°° norm of 7 is simply the diameter of the range of 7. 
The norm of 7 is finite precisely when 7 is rectifiable, and when 7 is differentiable 
it corresponds exactly with the length I7I of 7 defined earlier. It is easy to see the 
monotonicity property 

IItIIvp ^ IItIIv whenever 1 < r < p < 00 
and the triangle inequalities 

(Il7i||y'- + Il72||y0^^'' < IIti +7211^- < WnWv + ||72||y 

where 71 + 72 is the concatenation of 71 and 72. A key fact about the norms is that 
they can be subdivided: 

Lemma C.2. Let j : [a,b] ^ G be a continuously differentiable curve with finite 
norm. Then there exists a decomposition 7 = 71 + 72 of the curve into two sub-curves 
such that 

Il7i||y"; Il72||y ^ 2 "'"^''||7||y. 

Proof. Let = sup{t G [a, 6] : ||7|[a,t] ||y < 2~-'-/''||7||v'-}. Letting 71 = 7|[a,t,] we have 
||7i||y^ = 2~^/'^'||7||yr . The bound for 72 = 7|[t,,b] follows from the left triangle inequality 
above. □ 
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Given a continuously difFerentiable curve 7 : [a, b] — )■ G, we can define its left trace 
7i : [a, 6] — by the formula 

= [ 7(s)^S'(s) ds 



Note that the trace is also a continuously differentiable curve, but taking values now in 
the Lie algebra q instead of G. Clearly 7; is determined uniquely from 7. The converse 
is also true after specifying the initial point 7(0) of 7, since 7 can then be recovered by 
solving the ordinary differential equation 

(72) y(t) = 7(t)7;(t). 

This equation is fundamental in the theory of eigenfunctions of a one-dimensional Schro- 
dinger or Dirac operator, or equivalently in the study of the nonlinear Fourier transform; 
see, for example, [33], [25] for a full discussion. Basically for a fixed potential f{t) and 
a frequency k, the nonlinear Fourier transform traces out a curve 7(t) (depending on k) 
taking values in a Lie group (e.g. SU{1, 1)), and the corresponding left trace is essentially 
the ordinary linear Fourier transform. 

It is easy to see that these curves have the same length (i.e. they have the same 
norm): 

(73) |7l = h\- 

We now show that something similar is true for the norms provided that r < 2. 

Lemma C.3. Let 1 < r < 2, let G be a connected finite- dimensional Lie group, and let 
II • llg be a norm on the Lie algebra of G. Then there exist a constant C > depending 
only on these above quantities, such that for all smooth curves 7 : [a, b] — )• G, we have 

(74) ||7||yr < ||7z||yr + Cmin(||7;||yr, hzllyO 
and 

(75) llTillv < IItIIv + C'min(||7llv''-5 IItIIf'-)- 

An analogous result holds for the right trace, /* l' {s)^[s)~^ ds, once the left-invariant 
norm on TgG is replaced by a right-invariant norm. 

Proof. We may take r > 1 since the claim is already known for r = 1 thanks to (I73p . 
It shall suffice to prove the existence of a small 5 > such that we have the estimate 

(76) \\^\\yr = \\^,\\yr+0{\hfyr) 

whenever ||7i||y ^ and similarly 

(77) WliWv- = \h\\v- + 0{\\j\\lr) 

whenever ||7||y < 6. (We allow the 0{) constants here to depend on r, the Lie group 
G, and the norm structure, but not on 5). Let us now see why these estimates will prove 
the lemma. Let us begin by showing that (j76p implies (j74p . Certainly this will be the 
case if 7; has norm less than 5. If instead 7; has norm larger than 6, we can use 
Lemma IC.21 repeatedlv to partition it into 0((^~''||7; ||yr) curves, all of whose norms 
are less than 6. These curves are the left-traces of various components of 7, and thus 
by (j76p these components have a norm bounded by some quantity depending on 5. 
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Concatenating these components together (using the triangle inequaUty) we obtain the 
result. A similar argument allows one to deduce ([75]) from (TTTI) . 

Next, we observe that to prove the two estimates ([76]) . (f77|) it suffices to just prove 
one of the two, for instance ([76]) . as this will also imply ([77|) for ||7||v'- sufficiently small 
by the usual continuity argument (look at the set of times t for which the restriction of 7 
to [a, b] obeys a suitable version of (j77|) . and use (j76p to show that this set is both open 
and closed if ||7||y is small enough). 

It remains to prove ()76l) for 6 sufficiently small. We shall in fact prove the more precise 
statement (note 7; (a) = 0) 

(78) ||log(7(a)-S(6))-7K&)llg <^ll7HI^ 

for some absolute constant K > (and for 6 sufficiently small), where log is the inverse 
of the exponential map exp : g — t- G. Note that it follows from a continuity argument 
as in the previous paragraph that if 6 is sufficiently small then 7(6)~"'^7(a) is sufficiently 
close to the identity so that the logarithm is well-defined. Let us now see why (j78p 
implies (j76]) . Applying the inequality to any segment [tj,tj+i] in [a,b] we see that 

||log(7(t,-)"S(ij+i)) - Mtj+i) - li{t,))\\s < K\h\[t„t,+,]\\vr 
and hence with Lemma IC.ll (since 6 is small) 

d{7{tj+i),-/{tj)) = ||7Kij-+i) -7i(ij)ll0 + 0(||7i|fe,t,+i]llv^'-)- 

Estimating 0{\\-fi\[t^^t,+:,]\\v-) crudely by ||7«l|yC'(||7d[tj,tj+i] llv) and taking the T sum 
in the j index, we see that for any partition a = < . . . < tn = b we have 

(5]d(7(t,+i),7(t,))^) '= ' + 0(ll7dl^)- 

j=0 j=0 
Taking suprema over all partitions we obtain the result. 

It remains to prove (j78p for some suitably large K. This we shall do by an induction 
on scale (or "Bellman function") argument. Let us fix the smooth curve 7. We shall 
prove the estimate for all subcurves of 7, i.e. for all intervals [ii,i2] in [O'lb], we shall 
prove that 

(79) II log(7(ti)-S(i2)) - (7^*2) - liitiMs < K\hi\[t,M\\v'- 

Let us first prove this in the case when the interval [ti,t2] is sufficiently short, say of 
length at most e for some very small e (depending on 7). In that case, we perform a 
Taylor expansion to obtain 

(80) ji{t) = jiiU) + j'i{ti){t - ti) + ^7l'{h){t - tif + 0^{{t - tif) 
and 

(81) 7;(t) = l[{ti) + l'i'{h){t - ti) + 0,{{t - ti)2) 

when t € [ti,t2]) and where the 7 subscript in means that the constants here are 
allowed to depend on 7 (more specifically, on the norm of 7), and the 0() is with 
respect to the ||||g norm. Also we remark that as 7 is assumed smooth, 7^'(ti) is bounded 
away from zero. It is then an easy matter to conclude that 

(82) \h\it,M\\v^ ^ lHih)h\t2-h\ 
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if e is sufficiently small depending on 7. On the other hand, from (j72|) and (jSip we have 

I'it) = 7(t)(7[(ti) + l'i'{h)it - h) + 0,i{t - hf)) 
from which one may conclude that 

7(t) = 7(ii) exp(7[(ti)(t - h) + ^^l'{h){t - + 0{H{h)\\t - + 0^((t - hf)) 
for all t £ [ti, t2], if 7 is sufficiently small. We rewrite this as 
log(7(ti)^S(i)) 

= 7Kti)(i - ii) + ^7r(ii)(i - h? + 0{H{h)\\t - tip) + 0,((t - 
and then specialize to the case t = t2- By ()80p . we have 

log(7(ti)-S(t2)) - (7/(i2) - liih)) = 0(||7/'(ti)'lMt2 - til') + 0^{{t2 - hf), 

and hence by (|82|) we have (|79|) if t2 — ti is small enough (depending on 7) and K is 
large enough (independent of 7). 



This proves (j79|) when the interval [ii,t2] is small enough. By ([82]) . it also proves ([79]) 
when ||7;|[fi,f2] ||y is sufficiently small. To conclude the proof of ([79]) in general, we now 
assert the following inductive claim: if (|79p holds whenever ||7«|[ti,t2]llv^'' < ^ some 
given < e < (5, then it also holds whenever ||7;|[ti,t2] ||y < 2^/''e, providing that K 
is sufficiently large [independent of e) and S is sufficiently small (depending on K, but 
independent of e). Iterating this we will obtain the claim (j79p for all intervals [ti,t2] in 
[a, 6]. 

It remains to prove the inductive claim. Let [^1,^2] be any subinterval of [a, 6] such 
that the quantity A = ||7z|[ti,t2] 11^'' ^^^^ than Applying Lemma [C.21 we may 

subdivide [ti,t2] = [^1,^*] U [^^,^2] such that 

Il7/I[ti,t.]llv'-, Il7d[t.,i2]llv^'- ^ 2"^/M < e < r. 
By the inductive hypothesis, we thus have 

II log(7(ti)-S(t*)) - iliit.) - 7/(ti))llB < 1^2-2/^2 

and 

II log(7(t.)-S(t2)) - (7/(^2) - 7/(i*))ll0 < i^2-2/M2. 
In particular, we have 

l|log(7(ii)"S(t*))llg< ll7/(t*)-7/(ti)ll0 + i^2-2/M2 
< \hi\[h,t,]\\v^+0{KA^) 
= 0{A{1 + KA)) = 0{A{1 + K6)) = 0{A) 
if 5 is sufficiently small depending on K. Similarly we have 

||log(7(t*)-Mi2))|ls = 0(^) 
and hence by the Baker-Campbell-Hausdorff formula (if 5 is sufficiently small) 

II log(7(ti)-S(t2)) - log(7(ti)-S(i*)) - Iog(7(t*)-S(i2))||0 = 0{A'). 
By the triangle inequality, we thus have 

||log(7(ti)-S(t2)) - Mt2)-7i{ti))h < 2^2-2/^2 + 0(^2). 
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We now use the hypothesis r < 2, which forces 2 x 2"^/*" < 1. If K is large enough 
(depending on r, but independently of 5, A, or e) we thus have (I79p . This closes the 
inductive argument. □ 



Letting w,v he any elements of the Lie algebra g, one can define a nonlinear Fourier 
summation operator associated to G, w, v by means of the left trace 

MC[f]ik,0) = I 

-^MC[f]ik,x) = MC[f]ik,x) (Re(e-2^^'=^7(x))t/; + Im(e-2^*'^^/(x))z; 
or (giving a different operator) by the right trace 

m[f]ik,o) = i 

d_ 

dx' 

Above, A;, X G M, NC[f] takes values in G, / is the identity element of G, and Re, Im are 
the real and imaginary parts of a complex number. An example of interest is given by 
G = 5J7(1,1), and 

i 



-MC[f]{k,x) = (Reie-^'''^'' f{x))w + lm{e-'^^'^'^ f{x))v^ MC[f]{k,x). 



w 



-i 



Combining Lemma IC.31 with the variational Menshov-Paley-Zygmund theorem of the 
previous section, we obtain a variational version of the Christ-Kiselev theorem [6]. 
Namely, we see that for 1 < p < 2 and r > p 

and 

||l|A^C[/]|>l-^C[/]||yj,/ < Gp^r,G,w,v\\f\\LPiR)- 

Note that the usual logarithms are hidden in the d metric we have placed on the Lie 
group G. 

Extending these estimates to the case p = 2 is an interesting and challenging problem, 
even when r = oo, which would correspond to a nonlinear Carleson theorem. Lemma 
IC.3I cannot be extended to any exponent r > 2. Sandy Davie and the fifth author of 
this paper have an unpublished example of a curve in the Lie group SU (1,1) with trace 
in the subspace of su(l, 1) of matrices vanishing on the diagonal so that the diameter of 
the curve is not controlled by the 2-variation of the trace. 



Terry Lyons' machinery [2^ via iterated integrals faces an obstruction in a potential 
application to a nonlinear Carleson theorem because of the unboundedness results for 
the iterated integrals shown in |26j . 



D. An application to ergodic theory 



Wiener- Wintner type theorems is an area in ergodic theory that is most closely related 
to the study of Carleson's operator. In [19], Lacey and Terwilleger prove the following 
singular integral variant of the Wiener- Wintner theorem: 
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Theorem D.l. For 1 < p, all measure preserving flows {T^ : t G M} on a probability 
space {X,fi) and functions f G U'{^), there is a set Xf C X of probability one, so that 
for all X G Xf we have that the limit 



lim / e^^'f^x)^ 

exists for all 6 £M. 



One idea to approach such convergence resuhs is to study quantitative estimates in 
the parameter s that imply convergence, as pioneered by Bourgain's paper [1] in similar 
context. We first need to pass to a mollified variant of the above theorem: 

Theorem D.2. Let (j) be a function on M in the Wiener space, i.e. the Fourier transform 
(j) is in L^(]R). For 1 < p, all measure preserving flows {Tt : t G M} on a probability space 
{X,n) and functions f G ^^(/i), there is a set Xf C X of probability one, so that for all 
X E Xf we have that the limits 



lim / e'^'fiTtx^ist)^ 

i^OO J t 

hm / e^<>'fiTtx)cPist)^ 

s-i>0 J t 



exist for all 9 G 



This theorem clearly follows from an a priori estimate 



sup 



e'''f{Ttx)<P{st) 



dt.. 

j\\v-{s) 



LP(x) 



<c\\f\\ 



LP 



for r > max(2,p'). Here we have written V^{s) for the variation norm taken in the 
parameter s of the expression inside, and likewise for L^^x). The variation norm is the 
strongest norm widely used in this context, while Lacey and Terwilliger use a weaker 
oscillation norm in the proof of their Theorem. 



By a standard transfer method ([3], [7]) involving replacing / by translates Tyf and 
an averaging procedure in y, the a priori estimate can be deduced from an analogous 
estimate on the real line 



(83) 



sup 



dt 



e^''f{x + t)^{st)j 



< 



LP{x) 



LP 



The main purpose of this appendix is to show how this estimate (j83p can be deduced 
from the main theorem of this paper by an averaging argument. We write the V^{s) 
norm explicitly and expand (j) into a Fourier integral to obtain for the left hand side of 
(f83|) the expression 



K 

sup sup I 



e^'^fix + t)e 



r\ l/r 



LP{x) 



Now pulling the integral in r/ out of the various norms and considering only positive r/ 
(with the case of negative r] being similar) and defining Cfc = C + V^k we obtain the upper 
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bound 

Now applying the variational Carleson estimate and doing the trivial integral in rj bounds 
this term by a constant times 

Remark D.3. To prove the Lacey-Terwilleger theorem ID.ll from the mollified version, 
one may approximate the characteristic functions used as cutoff functions by Wiener 
space functions so that the difference is small in norm. Then at least for / in L°° one 
can show convergence of the limits by an approximation argument, even though one will 
not recover the full strength of the quantitative estimate in the Wiener space setting. 
The result for / in L°° can then be used as a dense subclass result in other spaces, 
which can be handled by easier maximal function estimates and further approximation 
arguments. 

Remark D.4. The classical version of the Wiener- Wintner theorem does not invoke sin- 
gular integrals but more classical averages of the type 

^[ e^''f{Ttx)dt . 
2s J\t\<s 

We note that the same technique as above may be applied to these easier averages. 
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